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Description 

Background 

The present invention relates generally to the manipulation of genetic materials and, more particularly, 
s to recombinant procedures making possible the production of polypeptides possessing part or all of the 
primary structural conformation. 



A. Manipulation Of Genetic Materials 

Genetic materials may be broadly defined as those chemical substances which program for and guide 

to the manufacture of constituents of cells and viruses and direct the responses of cells and viruses. A long 
chain polymeric substance known as deoxyribonucleic acid (DNA) comprises the genetic material of all 
living cells and viruses except for certain viruses which are programmed by ribonucleic acids (RNA). The 
repeating units in DNA polymers are four different nucleotides, each of which consists of either a purine 
(adenine or guanine) or a pyrimidine (thymine or cytosine) bound to a deoxyribose sugar to which a 

15 phosphate group is attached. Attachment of nucleotides in linear polymeric form is by means of fusion of 
the 5' phosphate of one nucleotide to the 3' hydroxyl group of another. Functional DNA occurs in the form 
of stable double stranded associations of single strands of nucleotides (known as deoxyoligonucleotides), 
which associations occur by means of hydrogen bonding between purine and pyrimidine bases [i.e., 
"complementary" associations existing between adenine (A) and thymine (T) or guanine (G) and cytosine 

20 (C)]. By convention, nucleotides are referred to by the names of their constituent purine or pyrimidine 
bases, and the complementary associations of nucleotides in double stranded DNA (i.e., A— T and G — C) 
are referred to as "base pairs". Ribonucleic acid is a polynucleotide comprising adenine, guanine, cytosine 
and uracil <U), rather than thymine, bound to ribose and a phosphate group. 

Most briefly put, the programming function of DNA is generally effected through a process wherein 

25 specific DNA nucleotide sequences (genes) are "transcribed" into relatively unstable messenger RNA 
(mRNA) polymers. The mRNA, in turn, serves as a template for the formation of structural, regulatory and 
catalytic proteins from amino adds. This mRNA "translation" process involves the operations of small RNA 
strands (tRNA) which transport and align individual amino acids along the mRNA strand to allow for 
formation of polypeptides in proper amino acid sequences. The mRNA "message", derived from DNA and 

30 providing the basis for the tRNA supply and orientation of any given one of the twenty amino acids for 
polypeptide "expression", is in the form of triplet "codons" — sequential groupings of three nucleotide 
bases. In one sense, the formation of a protein is the ultimate form of "expression" of the programmed 
genetic message provided by the nucleotide sequence of a gene. 

"Promoter" DNA sequences usually "precede" a gene in a DNA polymer and provide a site for 

35 initiation of the transcription into mRNA. "Regulator" DNA sequences, also usually "upstream" of (i.e., 
preceding) a gene in a given DNA polymer, bind proteins that determine the frequency (or rate) of 
transcriptional initiation. Collectively referred to as "promoter/regulator" or "control" DNA sequence, 
these sequences which precede a selected gene (or series of genes) in a functional DNA polymer cooperate 
to determine whether the transcription (and eventual expression) of a gene will occur. DNA sequences 

40 which "follow" a gene in a DNA polymer and provide a signal for termination of the transcription into 
mRNA are referred to as transcription "terminator" sequences. 

A focus of microbiological processing for the last decade has' been the attempt to manufacture 
industrially and pharmaceutical^ significant substances using organisms which either do not initially have 
genetically coded information concerning the desired product included in their DNA, or (in the case of 

45 mammalian cells in culture) do not ordinarily express a chromosomal gene at appreciable levels. Simply 
put, a gene that specifies the structure of a desired polypeptide product is either isolated from a "donor" 
organism or chemically synthesized and then stably introduced into another organism which is preferably 
a self-replicating unicellular organism such as bacteria, yeast or mammalian cells in culture. Once this is 
done, the existing machinery for gene expression in the "transformed" or "transfected" microbial host 

so cells operates to construct the desired product, using the exogenous DNA as a template for transcription of 
mRNA which is then translated into a continuous sequence of amino acid residues. 

The art is rich in patent and literature publications relating to "recombinant DNA" methodologies for 
the isolation, synthesis, purification and amplification of genetic materials for use in the transformation of 
selected host organisms. U.S. Letters Patent No. 4,237,224 to Cohen, et al., for example, relates to 

55 transformation of unicellular host organisms with "hybrid" viral or circular plasmid DNA which includes 
selected exogenous DNA sequences. The procedures of the Cohen, et al. patent first involve manufacture 
of a transformation vector by enzymatically cleaving viral or circular plasmid DNA to form linear DNA 
strands. Selected foreign ("exogenous" or "heterologous") DNA strands usually including sequences 
coding for desired product are prepared in linear form through use of a similar enzymes. The linear viral or 

60 plasmid DNA is incubated with the foreign DNA in the presence of ligating enzymes capable of effecting a 
restoration process and "hybrid" vectors are formed which include the selected exogenous DNA segment 
"spliced" into the viral or circular DNA plasmid. 

Transformation of compatible unicellular host organisms with the hybrid vector results in the 
formation of multiple copies of the exogenous DNA in the host cell population. In some instances, the 

55. desired result is simply the amplification of the foreign DNA and the "product" harvested is DNA. More 
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frequently, the goal of transformation is the expression by the host cells of the exogenous DNA in the form 
of large scale synthesis of isolatable quantities of commercially significant protein or polypeptide 
fragments coded for by the foreign DNA. See also, e.g., U.S. Letters Patent Nos. 4,264,731 (to Shine), 
4,273,875 (to Manis), 4,293,652 (to Cohen), and European Patent Application 093,619, published November 
5 9, 1983. 

The development of specific DNA sequences for splicing into DNA vectors is accomplished by a variety 
of techniques, depending to a great deal on the degree of "foreignness" of the "donor" to the projected 
host and the size of the polypeptide to be expressed in the host At the risk of over-simplification, it can be 
stated that three alternative principal methods can be employed: (1) the "isolation" of double-stranded 
io DNA sequence from the genomic DNA of the donor; (2) the chemical manufacture of a DNA sequence 
providing a code for a polypeptide of interest; and (3) the in vitro synthesis of a double-stranded DNA 
sequence by enzymatic "reverse transcription" of mRNA isolated from donor cells. The last-mentioned 
methods which involve formation of a DNA "complement" of mRNA are generally referred to as "cDNA" 
methods. 

15 Manufacture of DNA sequences is frequently the method of choice when the entire sequence of amino 
acid residues of the desired polypeptide product is known. DNA manufacturing procedures of co-owned, 
co-pending U.S. Patent Application Serial No. 483,451, by Alton, et al., (filed April 15, 1983 and 
corresponding to PCT US83;00605, published November 25, 1983 as WO83/04053), for example, provide a 
superior means for accomplishing such highly desirable results as: providing for the presence of alternate 

20 codons commonly found in genes which are highly expressed in the host organism selected for expression 
(e.g., providing yest or £ colt "preference" codons); avoiding the presence of untranslated "intron" 
sequences (commonly present in mammalian genomic DNA sequences and mRNA transcripts thereof) 
which are not readily processed by procaryotic host cells; avoiding expression of undesired "leader" 
polypeptide sequences commonly coded for by genomic DNA and cDNA sequences but frequently not 

25 readily cleaved from the polypeptide of interest by bacterial or yeast host cells; providing for ready 
insertion of the DNA in convenient expression vectors in association with desired promotor/regulator and 
terminator sequences; and providing for ready construction of genes for polypeptide fragments and 
analogs of the desired polypeptides. 

When the entire sequence of amino acid residues of the desired polypeptide is not known, direct 

30 manufacture of DNA sequences is not possible and isolation of DNA sequences coding for the polypeptide 
by a cDNA method becomes the method of choice despite the potential drawbacks in ease of assembly of 
expression vectors capable of providing high levels of microbial expression referred to above. Among the 
standard procedures for isolating cDNA sequences of interest is the preparation of plasmid-borne cDNA 
"libraries" derived from reverse transcription of mRNA abundant in donor cells selected as responsible for 

35 high level expression of genes (e.g., libraries of cDNA derived from pituitary cells which express relatively 
large quantities of growth hormone products). Where substantial portions of the polypeptide's amino acid 
sequence are known, labelled, single-stranded DNA probe sequences duplicating a sequence putatively 
present in the "target" cDNA may be employed in DNA/DNA hybridization procedures carried out on 
cloned copies of the cDNA which have been denatured to single stranded form. [See, generally, the 

40 disclosure and discussions of the art provided in U.S. Patent No. 4,394,443 to Weissman, et al. and the 
recent demonstrations of the use of long oligonucleotide hybridization probes reported in Wallace, et al., 
Nuc. Acids Res., 6, pp. 3543—3557 (1979), and Reyes, et al., P.N.A.S. (U.SA.f, 79, pp. 3270—3274 (1982), 
and Jaye, et aL, Nuc. Acids Res., 1 1, pp. 2325—2335 (1 983). See also, U.S. Patent No. 4,358,535 to Falkow, et 
al., relating to DNA/DNA hybridization procedures in effecting diagnosis; published European Patent 

45 Application Nos. 0070685 and 0070687 relating to light-emitting labels on single stranded polynucleotide 
probes; Davis, et aL, "A Manual for Genetic Engineering, Advanced Bacterial Genetics", Cold Spring 
Harbor Laboratory, Cold Spring Harbor, N.Y. (1980) at pp. 55—58 and 174—176, relating to colony and 
plaque hybridization techniques; and, New England Nuclear (Boston, Mass.) brochures for "Gene Screen" 
Hybridization Transfer Membrane materials providing instruction manuals for the transfer and 

so hybridization of DNA and RNA, Catalog No. NEF-972.] 

Among the more significant recent advances in hybridization procedures for the screening of 
recombinant clones is the use of labelled mixed synthetic oligonucleotide probes, each of which is 
potentially the complete complement of a specific DNA sequence in the hybridization sample including a 
heterogenous mixture of single stranded DNAs and RNAs. These procedures are acknowledged to be 

55 especially useful in the detection of cDNA clones derived from sources which provide extremely low 
amounts of mRNA sequences for the polypeptide of interest. Briefly put, use of stringent hybridization 
conditions directed toward avoidance of non-specific binding can allow, e.g., for the autoradiographic 
visualization of a specific cDN A clone upon the event of hybridization of the target DNA to that single probe 
within the mixture which is its complete complement. See generally, Wallace, et aL, Nuc. Acids Res., 9, pp. 

60 879—897 (1981); Suggs, et al. P.N.A.S. (U.S.A.), 78, pp. 6613-6617 (1981); Choo, et al.. Nature, 299, pp. 
178-180 (1982); Kurachi, et aL, P.NA.S. {U.S. A), 79, pp. 6461—6464 (1982); Ohkubo, et aL, P.NA.S. 
(U.S.A.), 80, pp. 2196—2200 (1983); and Kornblihtt, et aL, P.N.A.S. (U.SA.), 80, pp. 3218-3222 (1983). In 
general, the mixed probe procedures of Wallace, et al. (1981), supra, have been expanded upon by various 
workers to the point where reliable results have reportedly been obtained in a cDNA clone isolation using a 

6s 32 member mixed "pool" of 16-base-long (16-mer) oligonucleotide probes of uniformly, varying DNA 
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sequences together with a single 11-mer to effect a two-site "positive" confirmation of the presence of 
cDNA of interest. See, Singer-Sam, et al., P.N.A.S. (U.S.A.), 80, pp. 802—806 (1983). 

The use of genomic DNA isolates is the least common of the three above-noted methods for 
developing specific DNA sequences for use in recombinant procedures. This is especially true in the area of 

5 recombinant procedures directed to securing microbial expression of mammalian polypeptides and is due, 
principally to the complexity of mammalian genomic DNA. Thus, while reliable procedures exist for 
developing phage-borne libraries of genomic DNA of human and other mammalian species origins [See, 
e.g., Lawn, et al. Cell, IB, pp. 1157—1174 (1978) relating to procedures for generating a human genomic 
library commonly referred to as the "Maniatis Library"; Karn, et al., P.NA.S. (U.SA.f, 77, pp 5172 — 5176 

io (1980) relating to a human genomic library based on alternative restriction endonuciease fragmentation 
procedure; and Blattner, et al., Science, 196, pp. 161 — 169 (1977) describing construction of a bovine 
genomic library] there have been relatively few successful attempts at use of hybridization procedures in 
isolating genomic DNA in the absence of extensive foreknowledge of amino acid or DNA sequences. As 
one example, Fiddes, et al., et a/., J. MoL and App. Genetics, 1, pp. 3 — 18 (1981) report the successful 

is isolation of a gene coding for the alpha subunit of the human pituitary glycoprotein hormones from the 
Maniatis Library through use of a "full length" probe including a complete 621 base pair fragment of a 
previously-isolated cDNA sequence for the alpha subunit. As another example, Das, et al., P.N.A.S. (U.S.A.), 
80, pp. 1531-1535 (1983) report isolation of human genomic clones for human HLA— DR using a 175 base 
pair synthetic oligonucleotide. Finally, Anderson, et al., PMA.S. (U.S.A.), 80, pp. 6838—6842 (1983) report 

20 the isolation of genomic clone for bovine pancreatic trypsin inhibitor (BPTI) using a single probe 86 base 
pairs in length and constructed according to the known amino acid sequence of BPTI. The authors note a 
determination of poor prospects for isolating mRNA suitable for synethesis of a cDNA library due to 
apparent low levels of mRNA in initially targeted parotid gland and lung tissue sources and then address 
the prospects of success in probing a genomic library using a mixture of labelled probes, stating: "More 

25 generally, mixed-sequence oligodeoxynucleotide probes have been used to isolate protein genes of 
unknown sequence from cDNA libraries. Such probes are typically mixtures of 8 — 32 oligonucleotides, 
14 — 17 nucleotides in length, representing every possible codon combination for a small stretch (5—6 
residues) of amino acid sequence. Under stringent hybridization conditions that discriminate against 
incorrectly base-paired probes, these mixtures are capable of locating specific gene sequences in clone 

30 libraries of low-to-moderate complexity. Nevertheless, because of their short length and heterogeneity, 
mixed probes often lack the specificity required for probing sequences as complex as a mammalian 
genome. This makes such a method impractical for the isolation of mammalian protein genes when the 
corresponding mRNAs are unavailable." (Citations omitted). 

There thus* continues to exist a need in the art for improved methods for effecting the rapid and 

35 efficient isolation of cDNA clones in instances where little is known of the amino acid sequence of the 
polypeptide coded for and where "enriched" tissue sources of mRNA are not readily available for use in 
constructing cDNA libraries. Such improved methods would be especially useful if they were applicable to 
isolating mammalian genomic clones where sparse information is available concerning amino acid 
sequences of the polypeptide coded for by the gene sought 

40 

B. Erythropoietin As A Polypeptide Of Interest 

Erythropoiesis, the production of red blood cells, occurs continuously throughout the human life span 
to offset cell destruction. Erythropoiesis is a very precisely controlled physiological mechanism enabling 
sufficient numbers of red blood cells to be available in the blood for proper tissue oxygenation, but not so 

45 many that the cells would impede circulation. The formation of red blood cells occurs in the bone marrow 
and is under the control of the hormone, erythropoietin. 

Erythropoietin, an acidic glycoprotein of approximately 34,000 dalton molcular weight, may occur in 
three forms: a, (3 and asialo. The a and £ forms differ slightly in carbohydrate components, but have the 
same potency, biological activity and molecular weight. The asialo form is an a and (3 form with the 

so terminal carbohydrate (sialic acid) removed. Erythropoietin is present in very low concentrations in plasma 
when the body is in a healthy state wherein tissues receive sufficient oxygenation from the existing number 
of erythrocytes. This normal low concentration is enough to stimulate replacement of red blood cells which 
are lost normally through aging. 

The amount of erythropoietin in the circulation is increased under conditions of hypoxia when oxygen 

55 transport by blood cells in the circulation is reduced. Hypoxia may be caused by loss of large amounts of 
blood through hemorrhage, destruction of red blood cells by over-exposure to radiation, reduction in 
oxygen intake due to high altitudes or prolonged unconsciousness, or various forms of anemia. In 
response to tissues undergoing hypoxic stress, erythropoietin will increase red blood cell production by 
stimulating the conversion of primitive precursor cells in the bone marrow into proerythroblasts which 

so subsequently mature, synthesize hemoglobin and are released into the circulation as red blood cells. When 
the number of red blood cells in circulation is greater than needed for normal tissue oxygen requirements, 
erythropoietin in circulation is decreased. 

See generally, Testa, et al., Exp. HematoL, 8(Supp: 8}, 144—152 (1980); Tong, et al., J. Biol. Chem., 
256(24), 12666—12672 (1981); Goldwasser, J. Cell. Physiol, 110 (Supp. 11, 133—135 (1982); Finch, Blood, 

65 60(6), 1241—1246 (1982); Sytowski, et al., Expt HematoL, 8(Supp 8), 52—64 (1980); Naughton, Ann. Clin. 

5 
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Lab. ScL, t3(5), 432—438 (1983); Weiss, et al., Am. J. Vet Res., 44(10), 1832—1835 (1983); Lappin, et aL, 
Exp. Hematot., 11(7), 661—666 (1983); Baciu, et al,Ann. N. Y. Acad. Set'., 414, 66—72 (1983); Murphy, et aL, 
Acta. Haematologica Japonica, 46(7), 1380—1396 (1983); Dessypris, et al., Brit. J. Haematol., 56, 295—306 
(1984); and, Emmanouel, et al., Am. J. Physiol. 247 (1 Pt 2), F168—76 (1984), 

5 Because erythropoietin is essential in the process of red blood cell formation, the hormone has 

potential useful application in both the diagnosis and the treatment of blood disorders characterized by low 
or defective red blood cell production. See, generally, Pennathur-Das, et al., Blood, 63(5), 1168—71 (1984) 
and Haddy, Am. Jour. Ped. Hematol./OncoL, 4, 191—196, (1982) relating to erythropoietin in possible 
therapies for sickle cell disease, and Eschbach, et al. J. Clin. Invest, 74(2), pp. 434—441 , (1 984), describing a 

io therapeutic regimen for uremic sheep based on in vivo response to erythropoietin-rich plasma infusions 
and proposing a dosage of 10 U EPO/kg per day for 15—40 days as corrective of anemia of the type 
associated with chronic renal failure. See also, Krane, Henry Ford Hops. Med. J., 31(3), 177—181 (1983). 

It has recently been estimated that the availability of erythropoietin in quantity would allow for 
treatment each year of anemias of 1,600,000 persons in the United States alone. See, e.g., Morrison, 

is "Bioprocessing in Space — an Overview", pp. 557—571 in The World Biotech Report 1984, Volume 2: USA, 
(Online Publications, New York, N.Y. 1984). Recent studies have provided a basis for projection of efficacy 
of erythropoietin therapy in a variety of disease states, disorders and state of hematologic irregularity; 
Vedovato, et a\. r Acta. Haematol, 71, 211—213 (1984) (beta-thalassemia); Vichinsky, et al.,*/ Pediatr. 105(1), 
15—21 (1984) (cystic fibrosis); Cotes, et al., Brit. J. Obstet Gyneacol. 90(4), 304—311 (1983) (pregnancy. 

20 menstrual disorders); Haga, et al., Acta. Pediatr. Scand., 72, 827—831 (1983) (early anemia of prematurity); 
Claus-Walker, et ai., Arch Phys. Med. RehabiL, 65, 370—374 (1984) (spinal cord injury); Dunn, et al., Eur. J. 
Appl. Physiol., 52, 178—182 (1984), (space flight); Miller, et al., Brit J. Haematol., 52., 54&-590 (1982) 
(acute blood loss); Udupa, etal.,*/. Lab. Clin. Med., 103(4), 574—580 and 581—588 (1984); and Lipschitz,et 
al.. Blood, 63(3), 502—509 (1983) (aging); and Dainiak, et al.. Cancer, 51(6) t 1101—1106 (1983) and 

25 Schwartz, et al„ Otolaryngol., 109, 269—272 (1983) (various neoplastic disease states accompanied by 
abnormal erythropoiesis). 

Prior attempts to obtain erythropoietin in good yield from plasma or urine have proven relatively 
unsuccessful. Complicated and sophisticated laboratory techniques are necessary and generally result in 
the collection of very Small amounts of impure and unstable extracts containing erythropoietin. 

30 U.S. Letters Patent No. 3,033,753 describes a method for partially purifying erythropoietin from sheep 
blood plasma which provides low yields of a crude solid extract containing erythropoietin. 

Initial attempts to isolate erythropoietin from urine yielded unstable, biologically inactive preparations of 
the hormone. U.S. Letters Patent No. 3,865,801 describes a method of stabilizing the biological activity of a 
crude substance containing erythropoietin recovered from urine. The resulting crude preparation 

35 containing erythropoietin purportedly retains 90% of erythropoietin activity, and is stable. 

Another method of purifying human erythropoietin from urine of patients with aplastic anemia is 
described in Miyake, et al., J. Biol. Chem., Vol. 252, No. 15 (August 10, 1977), pp. 5558—5564. This seven- 
step procedure includes ion exchange chromatography, ethanol precipitation, gel filtration, and adsorption 
chromatography, and yields a pure erythropoietin preparation with a potency of 70,400 units/mg of protein 

40 in 21% yield. 

U.S. Letters Patent No. 4,397,840 to Takezawa, et al., describes methods of preparing "an 
erythropoietin product" from healthy human urine specimens with weakly basic ion exchangers and 
proposes that the low molecular weight products obtained "have no inhibitory effects against 
erythropoietin. 

45 U.K. Patent Application No. 2,085,887 by Sugimoto, et al., published May 6, 1982, describes a process 
for the production of hybrid human lymphoblastoid cells, reporting production levels ranging from 3 to 420 
Units of erythropoietin per ml of suspension of cells (distributed into the cultures after mammalian host 
propagation containing up to 10 7 cells per ml. At the highest production levels asserted to have been 
obtained, the rate of erythropoietin production could be calculated to be 40 to about 4,000 Units/10 6 cells/48 

so hours in in vitro culture following transfer of cells from in vivo propagation systems. (See also the 
equivalent U.S. Letters Patent No. 4,377,513.) Numerous proposals have been made for isolation of 
erythropoietin from tissue sources, including neoplastic cells, but the yields have been quite low. See, e.g., 
Jelkman, et al., Expt HematoL, 11(7), 581—588 (1983); Tambourin, et al., PMA.S. (USA), 80, 6269—6273 
(1983); Katsuoka, et al., Gann, 74, 534-541 (1983); Hagiwara, et at.. Blood, 63(4), 828-835 (1984); and 

55 Choppin, et al., Blood, 64(2), 341—347 (1984). 

Other isolation techniques utilized to obtain purified erythropoietin involve immunological procedures. 
A polyclonal, serum-derived antibody directed against erythropoietin is developed by injecting an animal, 
preferably a rat or rabbit, with human erythropoietin. The injected human erythropoietin is recognized as a 
foreign antigenic substance by the immune system of the animal and elicits production of antibodies 

so against the antigen. Differing cells responding to stimulation by the antigenic substance produce and 
release into circulation antibodies slightly different from those produced by other responding cells. The 
antibody activity remains in the serum of the animal when its blood is extracted. While un purified serum or 
antibody preparations purified as a serum immunoglobulin G fraction may then be used in asssays to 
detect and complex with human erythropoietin, the materials suffer from a major disadvantage. This 

55 serum antibody, composed of all the different antibodies produced by individual cells, is polyclonal in 
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nature and will complex with components in crude extracts other than erythropoietin alone. 

Of interest to the background of the present invention are recent advances in the art of developing 
continuous cultures of cells capable of producing a single species of antibody which is specifically 
immunologically reactive with a single antigenic determinant of a selected antigen. See, generally, 

5 Chisholm, High Technology, Vol. 3, No. 1, 57—63 (1983). Attempts have been made to employ cell fusion 
and hybridization techniques to develop "monoclonal" antibodies to erythropoietin and to employ these 
antibodies in the isolation and quantitative detection of human erythropoietin. As one example, a report of 
the successful development of mouse-mouse hybridoma cell lines secreting monoclonal antibodies to 
human erythropoietin appeared in abstract form in Lee-Huang, Abstract No. 1463 of Fed. Proc, 41, 520 

io (1982). As another example, a detailed description of the preparation and use of a monoclonal, anti- 
erythropoietin antibody appears in Weiss, et aL, PNA.S. (U.SA.), 79, 546&— 5469 11982). See also, Sasaki, 
Biomed. Biochim. Acta., 42(11/12), S202— S206 (1983); Yanagawa, et aL, Blood, 64(2), 357—364 (1984); 
Yanagawa, et ah, J. Biol. Chem., 259(6), 2707—2710 (1984); and U. S Letters Patent No. 4,465,624. 

Also of interest to the background of the invention are reports of the immunological activity of 

15 synthetic peptides which substantially duplicate the amino acid sequence extant in naturally-occurring 
proteins, glycoproteins and nucleo proteins. More specifically, relatively low molecular weight 
polypeptides have been shown to participate in immune reactions which are similar in duration and extent 
to the immune reactions of physiologically significant proteins such as viral antigens, polypeptide 
hormones, and the like. Included among the immune reactions of such polypeptides is the provocation of 

20 the formation of specific antibodies in immunologically active animals. See, e.g., Lerner, et al., Cell, 23, 
309-310 (1981 ); Ross, et al., Nature, 294, 654-656 (1981); Walter, et al., PMA.S. (USA.), 77, 5197—5200 
(1980); Lerner, et al., P.NA.S. (U.S.A.), 78, 3403-3407 (1981); Walter, et al., P.N.A.S. (U.S.A.), 78, 
4882—4886 (1981); Wong, et aL, P.NA.S. (U.SA), 78, 7412—7416 (1981); Green et aL, Cell, 28, 477—487 
(1982); Nigg, et aL, P.NA.S. (U.SA.), 79, 5322—5326 (1982); Baron, et aL, Cell, 28, 395-404 (1982); 

25 Dreesman, et aL, Nature, 295, 15&— 160 (1982); and Lerner, Scientific American, 248, No. 2, 66—74 (1983). 
See, also, Kaiser, et aL, Science, 223, pp. 24S-255 (1984) relating to biological and immunological activities 
of synthetic peptides which approximately share secondary structures of peptide hormones but may not 
share their primary structural conformation. The above studies relate, of course, to amino acid sequences 
of proteins other than erythropoietin, a substance for which no substantial amino acid sequence 

30 information has been published. In co-owned, co-pending U.S. Patent Application Serial No. 463,724, filed 
February 4, 1983, by J. Egrie, published August 22, 1984 as European Patent Application No. 0 116 446, 
there is described a mouse-mouse hybridoma cell line (A.T.C.C. No. HB8209) which produces a highly 
specific monoclonal, anti-erythropoietin antibody which is also specifically iummunoreactive with a 
polypeptide comprising the following sequence of amino acids: 

35 following sequence of amino acids: 

NH 2 -Ala-Pro-Pro-Arg-Leu-lle-Cys-Asp-Ser-Arg-Vai-Leu-6lu-Arg-Tyr-Leu-Leu-Glu-Ala-Lys-COOH. 

The polypeptide sequence is one assigned to the first twenty amino acid residues of mature human 

40- erythropoietin isolated according to the method of Miyake, et aL, J. Biol. Chem., 252, 5558—5564 (1 977) and 
upon which amino acid analysis was performed by the gas phase sequencer (Applied Biosystems, Inc.) 
according to the procedure of Hewick, M., et aL, J. Biol. Chem., 256, 7990—7997 (1 981 ). See, also. Sue, et aL, 
Proc. Nat. Acad. ScL (USA), 80, pp. 3651—3655 (1983) relating to development of polyclonal antibodies 
against a synthetic 26-mer based on a differing amino acid sequence, and Sytowski, et aL, J. Immunol. 

45 Methods, 69, pp. 181—186 (1984). 

While polyclonal and monoclonal antibodies as described above provide highly useful materials for 
use in immunoassays for detection and quantification of erythropoietin and can be useful in the affinity 
purification of erythropoietin, it appears unlikely that these materials can readily provide for the large scale 
isolation of quantities of erythropoietin from mammalian sources sufficient for further analaysis, clinical 

so testing and potential wide-ranging therapeutic use of the substance in treatment of, e.g., chronic kidney 
disease wherein diseased tissues fail to sustain production of erythropoietin. It is consequently projected in 
the art that the best prospects for fully characterizing mammalian erythropoietin and providing large 
quantities of it for potential diagnostic and clinical use involve successful application of recombinant 
procedures to effect large scale microbial synthesis of the compound. 

55 While substantial efforts appear to have been made in attempted isolation of DNA sequences coding 
for human and other mammalian species erythropoietin, none appear to have been successful. This is due 
principally to the scarcity of tissue sources, especially human tissue sources, enriched in mRNA such as 
would allow for construction of a cDNA library from which a DNA sequence coding for erythropoietin might 
be isolated by conventional techniques. Further, so little is known of the continuous sequence of amino 

60 acid residues of erythropoietin that it is not possible to construct, e.g., long polynucleotide probes readily 4 
capable of reliable use in DNA/DNA hybridization screening of cDNA and especially genomic DNA libraries. 
Illustratively, the twenty amino acid sequence employed to generate the above-named monoclonal 
antibody produced by A.T.C.C. No. HB8209 does not admit to the construction of an unambiguous, 60 base 
oligonucleotide probe in the manner described by Anderson, et aL, supra. It is estimated that the human 

6S gene for erythropoietin may appear as a "single copy gene" within the human genome and, in any event, 
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the genetic material coding for human erythropoietin is likely to constitute less than 0.00005% of total 
human genomic DNA which would be present in a genomic library. 

To date, the most successful of known reported attempts at recombinant-related methods to provide 
DNA sequences suitable for use in microbial expression of isolatable quantities of mammalian 
erythropoietin have fallen far short of the goal. As an example, Farber, et al. Exp. Hematoi, 11. Supp. 14, 
Abstract 101 (1983) report the extraction of mRNAfrom kidney tissues of phenylhydrazine-treated baboons 
and the injection of the mRNA into Xenopus laevis oocytes with the rather transitory result of in vitro 
production of a mixture of "translation products" which included among them displaying biological 
properties of erythropoietin. More recently, Farber, et al., Blood, 62, No. 5, Supp. No. 1, Abstract 392, at 
page 122a (1983) reported the in vitro translation of human kidney mRNA by frog ooctyes. The resultant 
translation product mixture was estimated to include on the order of 220 mU of a translation product 
having the activity of erythropoietin per microgram of injected mRNA. While such levels of in vitro 
translation of exogenous mRNA coding for erythropoietin were acknowledged to be quite low (compared 
even to the prior reported levels of baboon mRNA translation into the sought-for product) it was held that 
the results confirm the human kidney as a site of erythropoietin expression, allowing for the construction of 
an enriched human kidney cDNA library from which the desired gene might be isolated. [See also Farber 
din. Res., 31(4), 769A (1983).) 

Since the filing of U.S. Patent Application Serial Nos. 561,024 and 582,185, there has appeared a single 
report of the cloning and expression of what is asserted to have been human erythropoietin cDNA in £ coll 
Briefly put, a number of cDNA clones were inserted into £ coii plasmids and (3-lactamase fusion products 
were noted to be immunoreactive with a monoclonal antibody to an unspecified "epitope" of human 
erythropoietin. See, Lee-Huang, Proc. Nat. Acad. Sci. (USA), 81, pp. 2708—2712 (1984). 

Brief Summary 

The present invention provides, for the first time, novel purified and isolated polypeptide products 
having at least part of the primary structural conformation (i.e., continuous sequence of amino acid 
residues) of naturally-occurring erythropoietin, including allelic variants thereof. These polypeptides are 
also uniquely characterized by being the product of procaryotic or eucaryotic host expression (e.g., by 
bacterial, yeast and mammalian cells in culture) of exogenous DNA sequences obtained by genomic or 
cDNA cloning or by gene synthesis. Products of microbial expression in vertebrate (e.g., mammalian and 
avian) cells may be further characterized by freedom from association with human proteins or other 
contaminants which may be associated with erythropoietin in its natural mammalian cellular environment 
or in extracellular fluids such as plasma or urine. The products of typical yeast (e.g., Saccaromyces 
cerevisiae) or procaryote (e.g., Ecoii) host cells are free of association with any mammalian proteins. 
Depending upon the host employed, polypeptides of the invention may be glycosylated with mammalian 
or other eucaryotic carbohydrates or may be non-glycosylated. Polypeptides of the invention may also 
include an initial methionine amino acid residue (at position -1). 

Novel glycoprotein products of the invention include those having a primary structural conformation 
sufficiently duplicative of that of a naturally-occurring (e.g. human) erythropoietin to allow possession of 
biological properties thereof and having an average carbohydrate composition which differs from that of 
naturally-occurring (e.g., human) erythropoietin. 

Vertebrate (e.g., COS-1 and CHO) cells provided by the present invention comprise the first cells ever 
available which can be propagated in vitro continuously and which upon growth in culture are capable of 
producing in the medium of their growth in excess of 100U (preferably in excess of 500U and most 
preferably in excess of 1,000 to 5,000U) of erythropoietin per 10 6 cells in 48 hours as determined by 
radioimmunoassay. 

Also provided by the present invention are synthetic polypeptides wholly or partially duplicative of 
continuous sequences of erythropoietin amino acid residues which are herein for the first time elucidated. 
These sequences, by virtue of sharing primary, secondary or tertiary structural and conformational 
characteristics with naturally-occurring erythropoietin may possess biological activity and/or 
immunological properties in common with the naturally-occurring product such that they may be 
employed as biologically active or immunological substitutes for erythropoietin in therapeutic and 
immunological processes. Correspondingly provided are monoclonal and polyclonal antibodies generated 
by standard means which are immunoreactive with such polypeptides and, preferably, also 
immunoreactive with naturally-occurring erythropoietin. 

Illustrating the present invention are cloned DNA sequences of monkey and human species origins and 
polypeptide sequences suitably deduced therefrom which represent, respectively, the primary structural 
conformation of erythropoietins of monkey and human species origins. 

Also provided by the present invention are novel biologically functional viral and circular plasmid DNA 
vectors incorporating DNA sequences of the invention and microbial (e.g., bacterial, yeast and mammalian 
cell) host organisms stably transformed ortransfected with such vectors. Correspondingly provided by the 
invention are novel methods for the production of useful polypeptides comprising cultured growth of such 
transformed or transfected microbial hosts under conditions facilitative of large scale expression of the 
exogenous, vector-borne DNA sequences and isolation of the desired polypeptides from the growth 
medium, cellular lysates or cellular membrane fractions. 
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Isolation and purification of microbially expressed polypeptides provided by the invention may be by 
conventional means including. e.g. f preparative chromatographic separations and immunological 
separations involving monoclonal and/or polyclonal antibody preparations. 

Having herein elucidated the sequence of amino acid residues of erythropoietin, the present invention 

s provides for the total and/or partial manufacture of DNA sequences coding for erythropoietin and including 
such advantageous characteristics as incorporation of codons "preferred" for expression by selected non- 
mammalian hosts, provision of sites for cleavage by restriction endonuclease en2ymes and provision of 
additional initial, terminal or intermediate DNA sequences which facilitate construction of readily 
expressed vectors. Correspondingly, the present invention provides for manufacture (and development by 

w site specific mutagenesis of cDNA and genomic DNA) of DNA sequences coding for microbial expression of 
polypeptide analogs or derivatives of erythropoietin which differ from naturally-occurring forms in terms of 
the identity or location of one or more amino acid residues (i.e., deletion analogs containing less than all of 
the residues specified for EPO and/or substitution analogs wherein one or more residues specified are 
replaced by other residues and/or addition analogs wherein one or more amino acid residues is added to a 

15 terminal or medial portion of the polypeptide); and which share some or all the properties of naturally- 
occurring forms. 

Novel DNA sequences of the invention include all sequences useful in securing expression in 
procaryotic or eucaryotic host cells of polypeptide products having at least a part of the primary structural 
conformation and one or more of the biological properties of erythropoietin which are comprehended by: 

20 (a) the DNA sequences set out in Tables V and VI herein or their complementary strands; (b) DNA 
seqeuences which hybridize (under hybridization conditions such as illustrated herein or more stringent 
conditions) to DNA sequences defined in (a) or fragments thereof; and (c) DNA sequences which, but for 
the degeneracy of the genetic code, would hybridize to DNA sequences defined in (a) and (b) above. 
Specifically comprehended in part (b) are genomic DNA sequences encoding allelic variant forms of 

25 monkey and human erythropoietin and/or encoding other mammalian species of erythropoietin. 
Specifically comprehended by part (c) are manufactured DNA sequences encoding EPO, EPO fragments 
and EPO analogs which DNA sequences may incorporate codons facilitating translation of messenger RNA 
in non-vertebrate hosts. 

Comprehended by the present invention is that class of polypeptides coded for by portions of the DNA 
30 complement to the top strand human genomic DNA sequence of Table VI herein, i.e., "complementary 
inverted proteins" as described by Tramontano, et al., Nucleic Acids Research, 12, pp. 5049 — 5059 (1984). 

Also comprehended by the invention are pharmaceutical compositions comprising effective amounts 
of polypeptide products of the invention together with suitable pharmaceutically acceptable diluent(s), 
adjuvant(s) and/or carrier(s) which allow for provision of erythropoietin therapy, especially in the treatment 
35 of anemic disease states and most especially such anemic states as attend chronic renal failure. 

Polypeptide products of the invention may be "labelled" by covalent association with a detectable 
marker substance (e.g., radiolabeled with 12S I) to provide reagents useful in detection and quantification of 
erythropoietin in solid tissue and fluid samples such as blood or urine. DNA products of the invention may 
also be labelled with detectable markers (such as radiolabeis and non-isotopic labels such as biotin) and 
40 employed in DNA hybridization processes to locate the erythropoietin gene position and/or the position of 
any related gene family in the human, monkey and other mammalian species chromosomal map. They can 
also be used for identifying the erthropoietin gene disorders at the DNA level and used as gene markers for 
identifying neighboring genes and their disorders. 

As hereinafter described in detail, the present invention further provides significant improvements in 
45 methods for detection of a specific single stranded polynucleotide of unknown sequence in a 
heterogeneous cellular or viral sample including multiple single-stranded polynucleotides where 

(a) a mixture of labelled single-stranded polynucleotide probes is prepared having uniformly varying 
sequences of bases, each of said probes being potentially specifically complementary to a sequence of 
bases which is putatively unique to the polynucleotide to be detected, 
so (b) the sample is fixed to a solid substrate, 

(c) the substrate having the sample fixed thereto is treated to diminish further binding of 
polynucleotides thereto except by way of hybridization to polynucleotides in said sample, 

(d) the treated substrate having the sample fixed thereto is transitorily contacted with said mixture of 
labelled probes under conditions facilitative of hybridization only between totally complementary 

55 polynucleotides, and 

(e) the specific polynucleotide is detected by monitoring for the presence of a hybridization reaction 
between it and a totally complementary probe within said mixture of labelled probes, as evidenced by the 
presence of a higher density of labelled material on the substrate at the locus of the specific polynucleotide 
in comparison to a background density of labelled material resulting from non-specific binding of labelled 

eo probes to the substrate. 

The procedures are especially effective in situations dictating use of 64, 128, 256, 512, 1024 or more 
mixed polynucleotide probes having, a length of 17 to 20 bases in DNA/DNA or RNA/RNA or DNA/RNA 
hybridizations. 

As described infra, the above-noted improved procedures have illustratively allowed for the 
6s identification of cDNA clones coding for erythropoietin of monkey species origins within a library prepared 
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from anemic monkey kidney cell mRNA. More specifically, a mixture of 128 uniformly varying 20-mer 
probes based on amino acid sequence information derived from sequencing fractions of human 
erythropoietin was employed in colony hybridization procedures to identify seven "positive" 
erythropoietin cDNA clones within a total of 200,000 colonies. Even more remarkably, practice of the 
improved procedures of the invention have allowed for the rapid isolation of three positive clones from 
within a screening of 1,500,000 phage plaques constituting a human genomic library. This was 
accomplished through use of the above-noted mixture of 128 20-mer probes together with a second set of 
128 17-mer probes based on amino acid analysis of a different continuous sequence of human 
erythropoietin. 

The above-noted illustrative procedures constitute the first known instance of the use of multiple 
mixed oligonucleotide probes in DNA/DNA hybridization processes directed toward isolation of 
mammalian genomic clones and the first known instance of the use of a mixture of more than 32 
oligonucleotide probes in the isolation of cDNA clones. 

Numerous aspects and advantages of the invention will be apparent to those skilled in the art upon 
consideration of the following detailed description which provides illustrations of the practice of the 
invention in its presently preferred embodiments. 



25 
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Detailed Description 

According to the present invention, DNA sequences encoding part or all of the polypeptide sequence of 
20 human and monkey species erythropoietin (hereinafter, at times, "EPO") have been isolated and 
characterized. Further, the monkey and human origin DNA has been made the subject of eucaryotic and 
procaryotic expression providing isolatable quantities of polypeptides displaying biological (e.g., 
immunological) properties of naturally-occurring EPO as well as both in vivo and in vitro biological 
activities of EPO. 

The DNA of monkey species origins was isolated from a cDNA library constructed with mRNA derived 
from kidney tissue of a monkey in a chemically induced anemic state and whose serum was 
immunologically determined to include high levels of EPO compared to normal monkey serum. The 
isolation of the desired cDNA clones containing EPO encoding DNA was accomplished through use of 
DNA/DNA colony hybridization employing a pool of 128 mixed, radiolabeled, 20-mer oligonucleotide 
probes and involved the rapid screening of 200,000 colonies. Design of the oligonucleotide probes was 
based on amino acid sequence information provided by enzymatic fragmentation and sequencing a small 
sample of human EPO. 

The DNA of human species origins was isolated from a human genomic DNA library. The isolation of 
clones containing EPO-encoding DNA was accomplished through DNA/DNA plaque hybridization 
employing the above-noted pool of 128 mixed 20-mer oligonucleotide probes and a second pool of 128 
radiolabeled 17-mer probes whose sequences were based on amino acids sequence information obtained 
from a different enzymatic human EPO fragment. 

Positive colonies and plaques were verified by means of dideoxy sequencing of clonal DNA using a 
subset of 16 sequences within the pool of 20-mer probes and selected clones were subjected to nulceotide 
sequence analysis resulting in deduction of primary structural conformation of the EPO polypeptides 
encoded thereby. The deduced polypeptide sequences displayed a high degree of homology to each other 
and to a partial sequence generated by amino acid analysis of human EPO fragments. 

A selected positive monkey cDNA clone and a selected positive human genomic clone were each 
inserted in a "shuttle" DNA vector which was amplified in Ecoii and employed to transfect mammalian 
45 cells in culture. Cultured growth of transfected host cells resulted in culture medium supernatant 
preparations estimated to contain as much as 3000 mU of EPO per ml of culture fluid. 

The following examples are presented by way of illustration of the invention and are specifically 
directed to procedures carried out prior to identification of EPO encoding monkey cDNA clones and human 
genomic clones, to procedures resulting in such identification, and to the sequencing, development of 
>o expression systems and immunological verification of EPO expression in such systems. 

More particularly, Example 1 is directed to amino acid sequencing of human EPO fragments and 
construction of mixtures of radiolabeled probes based on the results of this sequencing. Example 2 is 
generally directed to procedures involved in the identification of positive monkey cDNA clones and thus 
provides information concerning animal treatment and preliminary radioimmunoassay (RIA) analysis of 
►5 animal sera. Example 3 is directed to the preparation of the cDNA library, colony hybridization screening 
and verification of positive clones, DNA sequencing of a positive cDNA clone and the generation of monkey 
EPO polypeptides primary structural conformation (amino acid sequence) information. Example 4 is 
directed to procedures involved in the identification of positive human genomic clones and thus provides 
information concerning the source of the genomic library, plaque hybridization procedures and verification 
'O of positive clones. Example 5 is directed to DNA sequencing of a positive genomic clone and the generation 
of human EPO polypeptide amino acid sequence information including a comparison thereof to the 
monkey EPO sequence information. Example 6 is directed to procedures for construction of a vector 
incorporating EPO-encoding DNA derived from a positive monkey cDNA clone, the use of the vector for 
transaction of COS-1 cells and cultured growth of the transfected cells. Example 7 is directed to procedures 
s for construction of a vector incorporating EPO-encoding DNA derived from a positive human genomic 
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clone, the use of the vector for transfection of COS-1 cells and the cultured growth of the transfected cells. 
Example 8 is directed to immunoassay procedures performed on media supernatants obtained from the 
cultured growth of transfected cells according to Example 6 and 7. Example 9 is directed to in vitro and in 
vivo biological activity of microbially expressed EPO of Examples 6 and 7. 

s Example 10 is directed to a development of mammalian host expression systems for monkey species 

EPO cDNA and human species genomic DNA involving Chinese hamster ovary {"CHO") cells and to the 
immunological and biological activities of products of these expression systems as well as characterization 
of such products. Example 11 is directed to the preparation of manufactured genes encoding human 
species EPO and EPO analogs, which genes include a number of preference condons for expression in £ 

w coll and yeast host cells, and to expression systems based thereon. Example 12 relates to the 
immunological and biological activity profiles of expression products of the systems of Example II. 

Example 1 

A, Human EPO Fragment Amino Acid Sequencing 
;5 Human EPO was isolated from urine and subjected to tryptic digestion resulting in the development 

and isolation of 17 discrete fragments in quantities approximating 100—150 picomoles. 

Fragments were arbitrarily assigned numbers and were analyzed for amino acid sequence by 

microsequence analysis using a gas phase sequencer {Applied Biosystems) to provide the sequence 

information set out in Table I, below, wherein single letter codes are employed and "X" designates a 
20 residue which was not unambiguously determined. 

TABLE I 



Fragment No. Sequence Analysis Result 

A-P-P-R 
G-K-L-K 
A-L-G-A-Q-K 
V-L-E-R 
A-V-S-G-L-R 
L-F-R 
K-L-F-R 
Y-L-L-E-A-K 
L-I-C-D-S-R 
L-Y-T-G-E-A-C-R 
T-I-T-A-0-T-F-R 
E-A-I-S-P-P-D-A-A-M-A-A-P-L-R 
E-A-E-X-I-T-T-G-X-A-E-H-X-S-L- 

N-E-X-I-T-V-P 
V-Y-S-N-F-L-R 
S-L-T-T-L-L-R 
V-N-F-Y-A-W-K 
G-Q-A-L-L-V-X-S-S-Q-P-W- 
E-P-L-Q-L-H-V-D-K 



30 



35 



40 



45 



T4a 

T4b 

T9 

T13 

T16 

T18 

T21 

T25* 

T26a 

T26b 

T27 

T28 

T30 



50 



55 



T31 
T33 
T35 
T38 



60 B. Design and Construction of Oligonucleotide Probe Mixtures 

The amino acid sequences set out in Table I were reviewed in the context of the degeneracy of the 
genetic code for the purpose of ascertaining whether mixed probe procedures could be applied to DNA/ 
DNA hybridization procedures on cDNA and/or genomic DNA libraries. This analysis revealed that within 
Fragment No. T35 there existed a series of 7 amino acid residues (Val-Asn-Phe-Tyr-Ala-Trp-Lys) which 

65 could be uniquely characterized as encoded for by one of 128 possible DNA sequences spanning 20 base 
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pairs. A first set of 128 20-mer oligonucleotides was therefore synthesized by standard phosphoamidite 
methods (See, e.g., Beaucage, et al., Tetrahedron Letters, 22, pp. 1859—1862 (1981) on a solid support 
according to the sequence set out in Table II, below. 

TABLE II 





Residue 


- Val 


- Asn 


Phe 


Tyr 


Ala 


TT£ 


Lys 


10 


3' 


CAA 
T 
G 
C 


TTG 
A 


AAG 
A 


ATG 
A 


CGA 
T 
G 
C. 


ACC 


TT - 5' 



15 

Further analysis revealed that within fragment No. T38 there existed a series of 6 amino acid residues 
(Gln-Pro-Trp-Glu-Pro-Leu) on the basis of which there could be prepared a pool of 128 mixed 
oligonucleotide 17-mer probes as set out in Table III below. 



20 TABLE III 



Residue 


- Gin 


Pro 


He 


Glu 


Pro 


Leu 


3' 


GTT 
C 


GGA 
T 
G 
C 


ACC 


CTT 
C 


GGA 
T 

G 
C 


GA 
A 



Oligonucleotide probes were labelled at the 5' end with gamma — 32 P — ATP, 7500—8000 Ci/mmole 
(ICN) using T 4 polynucleotide kinase (NEN). 



Example 2 

A. Monkey Treatment Procedures and RIA Analysis 

35 Female Cynomolgus monkeys Macaca fascicularias (2.5—3 kg, 1.5—2 years old) were treated 
subcutaneously with a pH 7.0 solution of pheny (hydrazine hydrochloride at a dosage level of 12.5 mg/kg on 
days 1, 3 and 5. The hematocrit was monitored prior to each injection. On day 7, or whenever the 
hematocrit level fell below 25% of the initial level, serum and kidneys were harvested after administration 
of 25 mg/kg doses of ketamine hydrochloride. Harvested materials were immediately frozen in liauid 

40 nitrogen and stored at -70°C. 



45 



so 



55 



60 



B. RIA for EPO 

Radioimmunoassay procedures applied for quantitative detection of EPO in samples were conducted 
according to the following procedures: 

An erythropoietin standard or unknown sample was incubated together with antiserum for two hours 
at 37°C. After the two hour incubation, the sample tubes were cooled on ice, 125 l-la belled erythropoietin was 
added, and the tubes were incubated at 0°C for at least 1 5 more hours. Each assay tube contained 500 ul of 
incubation mixture consisting of 50 Ml of diluted immune sera, 10,000 cpm of ^-erythropoietin 5 ul 
trasylol and 0—250 ul of either EPO standard or unknown sample, with PBS containing 0.1% BSA making 
up the remaining volume. The antiserum used was the second test bleed of a rabbit immunized with a 1 % 
pure preparation of human urinary erythropoietin. The final antiserum dilution on the assay was adjusted 
so that the antibody-bound 125 l-EPO did not exceed 10—20% of the input total counts. In general, this 
corresponded to a final antiserum dilution of from 1:50,000 to 1:100,000. 

The antibody-bound ^-erythropoietin was precipitated by the addition of 150 Ml Staph A. After a 40 
mm. incubation, the samples were centrifuged and the pellets were washed two times with 0.75 ml 10 mM 
Tns-HCI pH 8.2 containing 0.15M NaCI, 2mM EDTA, and 0.05% Triton X-100. The washed pellets were 
counted in a gamma counter to determine the percent of ^-erythropoietin bound. Counts bound by pre- 
immune sera were subtracted from ail final values to correct for nonspecific precipitation. The 
erythropoietin content of the unknown samples was determined by comparison to the standard curve. 

The above procedure was applied to monkey serum obtained in Part A, above, as well as to the 
untreated monkey serum. Normal serum levels were assayed to contain approximately 36 mU/ml while 
treated monkey serum contained from 1000 to 1700 mU/ml. 
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Example 3 

A. Monkey cDNA Library Construction 

Messenger RNA was isolated from normal and anemic monkey kidneys by the guanidinium 
thiocyanate procedure of Chirgwin, et al.. Biochemistry, 18, p. 5294 (1979) and poly (A) + mRNA was purified 

5 by two runs of oligo(dT)-ceullose column chromatography, as described at pp. 197 — 198 in Maniatis, et al., 
"Molecular Cloning, A Laboratory Manual" (Cold Springs Harbor Laboratory, Cold Springs, Harbor, N.Y., 
1982). The cDNA library was constructed according to a modification of the general procedures of 
Okayama, et al., MoL and Ceil. BioL, 2, pp. 161—170 (1982). The key features of the presently preferred 
procedures were as follows: (1) pUC8 was used as the sole vector, cut with Pst\ and then tailed with oligo 

io cfT of 60—80 bases in length; (2) HincW digestion was used to remove the oligo dT tail from one end of the 
vector; (3) first strand synthesis and oligo dG tailing was carried out according to the published procedure; 
(4) BamH\ digestion was employed to remove the oligo dG tail from one end of the vector; and (5) 
replacement of the RNA strand by DNA was in the presence of two linkers 

75 (GATCTAAAGACCGTCCCCCCCCC and ACGGTCTTTA) 

in a three-fold molar excess over the oligo dG tailed vector. 

B. Colony Hybridization Procedures For Screening Monkey cDNA Library 

20 Transformed £ coii were spread out at a density of 9000 colonies per 10 x 10 cm plate on nutrient 
plates containing 50 micrograms/ml Ampicillin. GeneScreen filters (New England Nuclear Catalog No. NEF- 
972) were pre-wet on a BHI — CAM plate (Bacto brain heart infusion 37 g/L, Casamino acids 2 g/L and agar 
15 g/L, containing 500 micrograms/ml Chloramphenicol) and were used to lift the colonies off the plate. The 
colonies were grown in the same medium for 12 hours or longer to amplify the plasmid copy numbers. The 

25 amplified colonies (colony side up) were treated by serially placing the filters over 2 pieces of Whatman 
3 MM paper saturated with each of the following solutions: 

(1) 50 mM glucose— 25 mM Tris-HCI (ph 8.0)— 10 mM EDTA (pH 8.0) for five minutes; 

(2) 0.5 M NaOH for ten minutes; and 

(3) 1.0 M Tris-HCI (pH 7.5) for three minutes. 

^0 The filters were then air dried in a vacuum over at BOX for two hours. 

The filters were then subjected to Proteinase K digestion through treatment with a solution containing 
50 micrograms/ml of the protease enzyme in Buffer K [0.1M Tris-HCI (pH 8.0) — 0.15M NaCI — 10 mM EDTA 
(pH 8.2) — 0.2% SDS]. Specifically, 5 ml of the solution was added to each filter and the digestion was 
allowed to proceed at 55°C for 30 minutes, after which the solution was removed. 
35 The filters were then treated wtih 4 ml of a prehybridization buffer (5 x SSPE — 0.5% SDS— 100 
microgramms/ml SS £ coii DNA— 5 x BFP). The prehybridization treatment was carried out at 55°C, 
generally for 4 hours or longer, after which the prehybridization buffer was removed. 

The hybridization process was carried out in the following manner. To each filter was added 3 ml of 
hybridization buffer (5 x SSPE — 0.5% SDS— 100 micrograms/ml yeast tRNA) containing 0.025 picomoles of 
40 each of the 128 probe sequences of Table II (the total mixture being designated the EPV mixture) and the 
filters were maintained at 48°C for 20 hours. This temperature was 2°C less than the lowest of the calculated 
dissociation temperatures (Td) determined for any of the probes. 

Following hybridization, the filters were washed three times for ten minutes on a shaker with 6 x 
SSC— 0.1% SDS at room temperature and washed two to three times with 6 x SSC— 1% SDS at the 
45 hybridization temperature (48°C). 

Autoradiography of the filters revealed seven positive clones among the 200,000 colonies screened. 
Initial sequence analysis of one of the putative monkey cDNA clones (designated clone 83) was 
performed for verification purposes by a modification of the procedure of Wallace, et al.. Gene, 16, pp. 
21—26 (1981). Briefly, plasmid DNA from monkey cDNA clone 83 was linearized by digestion with fcoRI 
so and denatured by heating in a boiling water bath. The nucleotide sequence was determined by the dideoxy 
method of Sanger, et a!., P.NA.S. (U.S.A.) 74, pp. 5463—5467 (1977). A subset of the EPV mixture of probes 
consisting of 16 sequences was used as a primer for the sequencing reactions. 

C. Monkey EPO cDNA Sequencing 

55 Nucleotide sequence analysis of clone 83 was carried out by the procedures of Messing, Methods in 
Enzymofogy, 101, pp. 20—78 (1983). Set out in Table IV is a preliminary restriction map analysis of the 
approximately 1600 base pair EcoR\!Hind\\\ cloned fragment of clone 83. Approximate locations of 
restriction endo-nuclease enzyme recognition sites are provided in terms of number of bases 3' to the 
EcoR\ site at the 5' end of the fragment. Nucleotide sequencing was carried out by sequencing individual 

60 restriction fragments with the intent of matching overlapping fragments. For example, an overlap of 
sequence information provided by analysis of nucleotides in a restriction fragment designated C113 
{SauZA at -111/S/nal at —324) and the reverse order sequencing of a fragment designated C73 iAM at 
-~424tBstE)\ at -203). 

$5^ 
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TABLE IV 

Restriction Enzyme 
Recognition Site Approximate Location(s) 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



EcoRI 


1 


Sau3A 


111 


Smal 


• 180 


BstEII 


203 


Smal 


324 


Kpnl 


371 


Rsal 


372 


Alul 


424 


PstI 


426 


Alul 


430 


Hpal 


466 


Alul 


546 


PstI 


601 


PvuII 


604 


Alul 


605 


Alul 


782 


Alul 


788 


Rsal 


792 


PstI 


807 


Alul 


841 


Alul 


927 


Ncol 


946 


Sau3A 




Alul 


1072 


Alul 


1115 


Alul 


1223 


PstI 


1301 


Rsal 


1343 


Alul 


1384 


Hindll I 


1449 


Alul 


1450 


Hindlll 


1585 



Sequencing of approximately 1342 base pairs (within the region spanning the Saw3A site 3' to the fcorl 
site and the Hind\\\ site) and analysis of ail possible reading frames has allowed for the development of 

60 DNA and amino acid sequence information set out in Table V. In the Table, the putative initial amino acid 
residue of the amino terminal of mature EPO (as verified by correlation to the previously mentioned 
sequence analysis of twenty amino terminal residues) is designated by the numeral +1.The presence of a 
methionine-specifying ATG codon (designated -27) "upstream") of the initial amino terminal alanine 
residue as the first residue designated for the amino acid sequence of the mature protein is indicative of the 

65 likelihood that EPO is initially expressed in the cytoplasm in a precursor form including a 27 amino acid 
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"leader" region which is excised prior to entry of mature EPO into circulation. Potential glycosylation sites 
within the polypeptide are designated by asterisks. The estimated molecular weight of the translated 
region was determined to be 21,117 daltons and the M.W. of the 165 residues of the polypeptide 
constituting mature monkey EPO was determined to be 18,236 daltons. 

TABLE V 



Translation of Monkey EPO cDNA 



flffCCGCGCCCCCTGGACAGCCGCCCTCTCCTCCAGGCCCGTGGGGCTGGCCCTGCCC 
CGCTGAACTTCCCGGGATGAGGACTCCCGGTGTGGTCACCGCGCGCCTAGGTCGCTGAG 

-27 -20 
Met Gly Val His Glu Cys Pro Ala Trp 
GGACCCCGGCCAGGCGCGGAGATG GGG GTG CAC GAA TGT CCT GCC TGG 

-10 

Leu Trp Leu Leu Leu Ser Leu Val Ser Leu Pro Leu Gly Leu Pro 
CTG TGG CTT CTC CTG TCT CTC GTG TCG CTC CCT CTG GGC CTC CCA 

-1 +1 10 
Val Pro Gly Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val Leu 
GTC CCG GGC GCC CCA CCA CGC CTC ATC TGT GAC AGC CGA GTC CTG 

20 * 
Glu Arg Tyr Leu Leu Glu Ala Lys Glu Ala Glu Asn Val Thr Met 
GAG AGG TAC CTC TTG GAG GCC AAG GAG GCC GAG AAT GTC ACG ATG 

30 * AO 

Gly Cys Ser Glu Ser Cys Ser Leu Asn Glu Asn He Thr Val Pro 
GGC TGT TCC GAA AGC TGC AGC TTG AAT GAG AAT ATC ACC GTC CCA 



50' 

Asp Thr Lys Val Asn Phe Tyr Ala Trp Lys Arg Met Glu Val Gly 
GAC ACC AAA GTT AAC TTC TAT GCC TGG AAG AGG ATG GAG GTC GGG 

60 70 
Gin Gin Ala Val Glu Val Trp Gin Gly Leu Ala Leu Leu Ser Glu 
CAG CAG GCT GTA GAA QTC TGG CAG GGC CTG GCC CTG CTC TCA GAA 

80 * 

Ala Val Leu Arg Gly Gin Ala Val Leu Ala Asn Ser Ser Gin Pro 
GCT GTC CTG CGG GGC CAG GCC GTG TTG GCC AAC TCT TCC CAG CCT 

90 100 
Phe Glu Pro Leu Gin Leu His Met Asp Lys Ala He Ser Gly Leu 
TTC GAG CCC CTG CAG CTG CAC ATG GAT AAA GCC ATC AGT GGC CTT 

110 

Arg Ser He Thr Thr Leu Leu Arg Ala Leu Gly Ala Gin Glu Ala 
CGC AGC ATC ACC ACT CTG CTT CGG GCG CTG GGA GCC CAG GAA GCC 
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TABLE V (continuted) 

120 130 
5 lie Ser Leu Pro Asp Ala Ala Ser Ala Ala Pro Leu Arq Thr He 
ATC TCC CTC CCA GAT GCG GCC TCG GCT GCT CCA CTC CGA ACC ATC • 

140 

Thr Ala Asp Thr Phe Cys Lys Leu Phe Arg Val Tyr Ser Asn Phe 
ACT GCT GAC ACT TTC TGC AAA CTC TTC CGA GTC TAC TCC AAT TTC 

150 160 
Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala Cys Arg Arg 
1S CTC CGG GGA AAG CTG AAG CTG TAC ACG GGG GAG GCC TGC AGG AGA 

165 

Gly Asp Arg OP 

GGG GAC AGA TGA CCAGGTGCGTCCAGCTGGGCACATCCACCACCTCCCTCACCAACA 



20 



30 



CTGCCTGTGCCACACCCTCCCTCACCACTCCCGAACCCCATCGAGGGGCTCTCAGCTAAG 

2s CGCCAGCCTGTCCCATGGACACTCCAGTGCCAGCAATGACATCTCAGGGGCCAGAGGAAC 
TGTCCAGAGCACAACTCTGAGATCTAAGGATGTCGCAGGGCCAACTTGAGGGCCCAGAGC 
AGGAAGCATTCAGAGAGCAGCTTTAAACTCAGGAGCAGAGACAATGCAGGGAAAACACCT 
GAGCTCACTCGGCCACCTGCAAAATTTGATGCAGGACACGCTTTGGAGGCAATTTACCTG 
TTTTTGCACCTACCATCAGGGACAGGATGACTGGAGAACTTAGGTGGCAAGCTGTGACTT 
CTCAAGGCCTCACGGGCACTCCCTTGGTGGCAAGAGCCCCCTTGACACTGAGAGAATATT 
TTGCAATCTGCAGCAGGAAAAATTACGGACAGGTTTTGGAGGTTGGAGGGTACTTGACAG 
to GTGTGTGGGGAAGCAGGGCGGTAGGGGTGGAGCTGGGATGCGAGTGAGAACCGTGAAGAC 

AGGATGGGGGCTGGCCTCTGGTTCTCGTGGGGTCCAAGCTT 

HindHI 



35 



45 



h„H™lr VPe ^ se «l ue "«* ^ Table V may readily be subjected to analysis for the presence of highly 
E££ • re9 ' 0nS 8 u d/0r secondarv conformational characteristics indicative of potentially high J 
immunogenic regions by, e.g.. the methods of Hopp. et al.. P.NJ\.S. (USA.). 78. pp. 3824-3828 (1981 and 

so P ° IB ' OL ; 757 'PP- 105 - 132 ( 1 982) and/or Chou, etal.fi/ocW, /fpp. 222-245 974 and 

method k IStTk 9Y ' PP', 4 *- 47 < 1978 '- Computer-assisted analysis according to the Hopp. et aL 

KSSn^S^S.T?"" ° f \ Pr ° gra » desi 9" ated PEP R^nce Section 6.7 made available by 
Intelligenetics. Inc., 124 University Avenue, Palo Alto, California. 

. , , Example 4 

55 A. Human Genomic Library 

. . A £" 4 A Phage-borne human fetal liver genomic library prepared according to the procedures of Lawn 
et ai., cell, 18, pp. 533-643 (1979) was obtained and maintained for use in a plaque hybridization assay.' 

B. Plaque Hybridization Procedures For Screening Human Genomic Library 

60 the nr ?Z« S *w™ ^l"^ r* ° NAs Were fixed on fil,ers (S0 ' 000 ^ ues P er filter > acc °^ing to 
SuSS <n™ %^ 0, «H hO , d5 'I En ? mo °9*- 68 > PP- 3 89 ~395 (1979) except for the use of GeneScrlen 
Plus filters (New England Nuclear Catalog No. NEF-976) and NZYAM plates (NaCI, 5 g; MgCI,-6H 2 0, 2 g; 
NZ-Amine A, 10 g; yeast extract, 5 g; casamino acids, 2 g; maltose; 2 g; and agar, 15 g per liter) 

« c , o r » " e i ^ u Were baked at 80 ° C for 1 hour and *"> digested with Proteinase K as described in 
ss Example 3, Part B. Prehybndization was carried out with a 1 M NaCI — 1 % SDS buffer for 55°C for 4 hours or 
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more, after which the buffer was removed. Hybridization and post-hybridization washings were carried out 
as described in Example 3, Part B. Both the mixture of 128 20-mer probes designated EPV and the mixture 
of 128 17-mer probes of Table III (designated the EPQ mixture) were employed. Hybridization was carried 
out at 48°C using the EPV probe mixture. EPQ probe mixture hybridization was carried out at 46°C — 4 

5 degrees below the lowest calculated Td for members of the mixture. Removal of the hybridized probe for 
rehybridization was accomplished by boiling with 1 x SSC — 0.1% SDS for two minutes. Autoradiography 
of the filters revealed three positive clones (reactive with both probe mixtures) among the 1,500,000 phage 
plaques screened. Verification of the positive clones as being EPO-encoding was obtained through DNA 
sequencing and electron micrographic visualization of heteroduplex formation with the monkey cDNA of 

10 Example 3. This procedure also gave evidence of multiple introns in the genomic DNA sequence. 

Example 5 

Nucleotide sequence analysis of one of the positive clones (designated AhE1) was carried out and 
results obtained to date are set out in Table VI. 

15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65. 
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In Table VI, the initial continuous DNA sequence designates a top strand of 620 bases in what is 
apparently an untranslated sequence immediately preceding a translated portion of the human EPO gene. 
More specifically, the sequence appears to comprise the 5' end of the gene which leads up to a translated 
DNA region coding for the first four amino acids (-27 through -24) of a leader sequence ("presequence"). 

5 Four base pairs in the sequence prior to that encoding the beginning of the leader have not yet been 
unambiguously determined and are therefore designated by an "X". There then follows an intron of about 
639 base pairs (439 base pairs of which have been sequenced and the remaining 200 base pairs of which 
are designated "I.S.") and immediately preceding a codon for glutamine which has been designated as 
residue -23 of the translated polypeptide. The exon sequence immediately following is seen to code for 

io amino acid residues through an alanine residue (designated as the +1 residue of the amino acid sequence 
of mature human EPO) to the codon specifying threonine at position +26, whereupon there follows a 
second intron consisting of 256 bases as specifically designated. Following this intron is an exon sequence 
for amino acid residues 27 through 55 and thereafter a third intron comprising 612 base pairs commences. 
The subsequent exon codes for residues 56 through 1 15 of human EPO and there then commences a fourth 

is intron of 134 bases as specified. Following the fourth intron is an exon coding for residue Nos. 116 through 
166 and a "stop" codon (TGA). Finally, Table VI identifies a sequence of 568 base pairs in what appears to 
be an untranslated 3' region of the human EPO gene, two base pairs of which ("X") have not yet been 
unambiguously sequenced. 

Table VI thus serves to identify the primary structural conformation (amino acid sequence) of mature 

20 human EPO as including 166 specified amino acid residues (estimated M.W. = 18,399). Also revealed in the 
Table is the DNA sequence coding for a 27 residue leader sequence along with 5' and 3' DNA sequences 
which may be significant to promoter/operator functions of the human gene operon. Sites for potential 
glycosylation of the mature human EPO polypeptide are designated in the Table by asterisks. It is worthy of 
note that the specific amino acid sequence of Table VI likely constitutes that of a naturally occurring allelic 

25 form of human erythropoietin. Support for this position is found in the results of continued efforts at 
sequencing of urinary isolates of human erythropoietin which provided the finding that a significant 
number of erythropoietin molecules therein have a methionine at residue 126 as opposed to a serine as 
shown in the Table. 

Table VII, below, illustrates the extent of polypeptide sequence homology between human and 
30 monkey EPO. In the upper continuous line of the Table, single letter designations are employed to 
represent the deduced translated polypeptide sequences of human EPO commencing with residue -27 and 
the lower continuous line shows the deduced polypeptide sequence of monkey EPO commencing at 
assigned residue number -27. Asterisks are employed to highlight the sequence homologies. It should be 
noted that the deduced human and monkey EPO sequences reveal an "additional" lysine (K) residue at 
35 (human) position 116. Cross-reference to Table VI indicates that this residue is at the margin of a putative 
mRNA splice junction in the genomic sequence. Presence of the lysine residue in the human polypeptide 
sequence was further verified by sequencing of a cDNA human sequence clone prepared from mRNA 
isolated from COS — 1 cells transfored with the human genomic DNA in Example 7, infra. 
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Example 6 

The expression system selected for initial attempts at microbial synthesis of isolatable quantities of 
EPO polypeptide material coded for by the monkey cDNA provided by the procedures of Example 3 was 
one involving mammalian host cells (i.e., COS— 1 cells, AT.C.C. No. CRL — 1650). The cells were transfected 
with a "shuttle" vector capable of autonomous replication in Ecoff host (by virtue of the presence of 
pBR322-derived DNA) and the mammalian hosts (by virtue of the presence of SV40 virus-derived DNA). 

More specifically, an expression vector was constructed according to the following procedures. The 
plasmid clone 83 provided in Example 3 was amplified in E.coli and the approximately 1 .4 kb monkey EPO 
encoding DNA was isolated by £coRI and Hind\\\ digestion. Separately isolated was an approximately 4 0 
kb, Hmd\\\iSaf\ fragment from pBR322. An approximately 30 bp, EcoR\/SaI\ "linker" fragment was obtained 
from M13mp10 RF DNA (P and L Laboratories). This linker included, in series, an EcoRI sticky end, followed 
by Sst\, Sma\, BamH\ and Xba\ recognition sites and a Sal\ sticky end. The above three fragments were 
ligated to provide an approximately 5.4 kb intermediate plasmid ("pERS") wherein the EPO DNA was 
flanked on one side by a "bank" of useful restriction endonuclease recognition sites. pERS was then 
digested with HinM and Sail to yield the EPO DNA and the fcoRI to Sa/I (M13mp10) linker. The 1.4 kb 
fragment was ligated with an approximately 4.0 kb BamHVSah of pBR322 and another M13mp10 HindUU 
BamH\ RF fragment linker also having approximately 30 bp. The M13 linker fragment was characterized by 
a Hmd\\\ sticky end, followed by Psti, Sa/I, Xba\ recognition sites and a Bam\ sticky end. The ligation product 
was, again, a useful intermediate plasmid ("pBR — EPO") including the EPO DNA flanked on both sides by 
banks of restriction site. 

The vector chosen for expression of the EPO DNA in COS — 1 cells ("pDSVLI") had previously been 
constructed to allow for selection and autonomous replication in ExolL These characteristics are provided 
by the origin of replication and Ampicillin resistance gene DNA sequences present in the region spanning 
nucleotides 2448 through 4362 of pBR322. This sequence was structurally modified by the addition of a 
linker providing a Hind\\\ recognition immediately adjacent nucleotide 2448 prior to incorporation into the 
vector. Among the selected vector's other useful properties was the capacity to autonomously replicate in 
COS— 1 cells and the presence of a viral promoter sequence functional in mammalian cells. These 
characteristics are provided by the origin of replication DNA sequence and "late gene" viral promoter DNA 
sequence present in the 342 bp sequence spanning nucleotide numbers 5171 through 270 of the SV40 
genome. A unique restriction site (BamH\) was provided in the vector and immediately adjacent the viral 
promoter sequence through use of a commercially available linker sequence (Collaborative Research). Also 
incorporated in the vector was a 237 base pair sequence (derived as nucleotide numbers 2553 through 2770 
of SV40) containing the "late gene" viral mRNA polyadenylation signal (commonly referred to as a 
transcription terminator). This fragment was positioned in the vector in the proper orientation vis-a-vis the 
"late gene" viral promoter via the unique BamH\ site. Also present in the vector was another mammalian 
gene at a location not material to potential transcription of a gene inserted at the unique BamH\ site, 
between the viral promoter and terminator sequences. [The mammalian gene comprised an approximately 
2,500 bp mouse dihydrofolate reductase (DHFR) minigene isolated from plasmid pMG— 1 as in Gasser, et 
al., P.N.A.S. (U.SA.L 79, pp. 6522—6526, (1982).] Again, the major operative components of plasmid 
pDSVLI comprise nucleotides 2448 through 4362 of pBR322 along with nucleotides 5171 through 270 (342 
bp) and 2553 through 2770 (237 bp) of SV40 DNA. 

Following procedures described, e.g., in Maniatis, et al„ supra, the EPO-encoding DNA was isolated 
from plasmid pBR — EPO as a BamH\ fragment and ligated into plasmid pDSVLI cut with BamHl Restriction 
enzyme analysis was employed to confirm insertion of the EPO gene in the correct orientation in two of the 
resulting cloned vectors (duplicate vectors H and L). See Figure 2, illustrating plasmid pDSVL— MkE. 
Vectors with EPO genes in the wrong orientation were saved for use as negative controls in transfection 
experiments designed to determine EPO expression levels in hosts transformed with vectors having EPO 
DNA in the correct orientation. 

Vectors H, L, F, X and G were combined with carrier DNA (mouse liver and spleen DNA) were employed 
to transfect duplicate 60 mm plates by calcium phosphate micro precipitate methods. Duplicate 60 mm 
plates were also transfected with carrier DNA as a "mock" transformation negative control. After five days 
all culture media were tested for the presence of polypeptides possessing the immunological properties of 
naturally-occurring EPO. 

Example 7 

A. Initial EPO Expression System Involving COS— 1 Cells 

The system selected for initial attempts at microbial synthesis of isolatable quantities of human EPO 
polypeptide material coded for by the human genomic DNA EPO clone, also involved expression in 
mammalian host cells (i.e., COS-1 cells, A.T.C.C. No. CRL-1650). The human EPO gene was first sub- 
cloned into a shuttle vector which is capable of autonomous replication in both Ecoii hosts (by virtue of 
cv?^ 6 " 0 ! °f pB . R322 derived DNA) and in the mammalian cell line COS-1 (by virtue of the presence of 
SV40 virus derived DNA). The shuttle vector, containing the EPO gene, was then transfected into COS-1 
ceils. EPO polypeptide material was produced in the transfected cells and secreted into the cell culture 
media. 

More specifically, an expression vector was constructed according to the following procedures. DNA 
isolated from lambda clone XhE1, containing the human genomic EPO gene, was digested with BamW\ and 
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Hmd\\\ restriction endonucieases, and a 5.6 Kb DNA fragment known to contain the entire EPO gene was 
isolated. This fragment was mixed and ligated with the bacterial plasmid pUC8 (Bethesda Research 
Laboratories, Inc.) which had been similarly digested, creating the intermediate plasmid "pUC8— HuE", 
providing a convenient source of this restriction fragment 

s The vector chosen for expression of the EPO DNA in COS— 1 cells (pSV4SEt) had previously been 
constructed. Plasmid pSV4SEt contained DNA sequences allowing selection and autonomous replication in 
Eco/L These characteristics are provided by the origin of replication and Ampicillin resistance gene DNA 
sequences present in the region spanning nucleotides 2448 through 4362 of the bacterial plasmid pBR322. 
This sequence was structurally modified by the addition of a linker providing a Hindlll recognition site 

10 immediately adjacent to nucleotide 2448. Plasmid pSV4SEt was also capable of autonomous replication in 
COS — 1 cells. This characteristic was provided by a 342 bp fragment containing the the SV40 virus origin of 
replication (nucleotide numbers 5171 through 270). This fragment has been modified by the addition of a 
linker providing an £o?RI recognition site adjacent to nucleotide 270 and a linker providing a Sail 
recognition site adjacent nucleotide 5171. A 1061 bp fragment of SV40 was also present in this vector 

*5 (nucleotide numbers 1711 through 2772 plus a linker providing a Sa/I recognition site next to nucleotide 
number 2772). Within this fragment was an unique BamH\ recognition sequence. In summary, plasmid 
pSV4SEt contained unique BamH\ and Hind\\\ recognition sites, allowing insertion of the human EPO gene, 
sequences allowing replication and selection in Eco//', and sequences allowing replication in COS — 1 cells. 
In order to insert the EPO gene into pSV4SEt, plasmid pUCS— HuE was digested with BamH\ and Hindlll 

20 restriction endonucieases and the 5.6 kb EPO encoding DNA fragment isolated. pSV4SEt was also digested 
with BamHl and Hind\\\ and the major 2513 bp fragment isolated (preserving all necessary functions). These 
fragments were mixed and ligated, creating the final vector "pSVgHuEPO". (See, Figure 3.) This vector was 
propagated in Ecoli and vector DNA isolated. Restriction enzyme analysis was employed to confirm 
insertion of the EPO gene. 

25 Plasmid pSVgHuEPO DNA was used to express human EPO polypeptide material in COS— 1 cells. 
More specifically, pSVgHuEPO DNA was combined with carrier DNA and transfected into triplicate 60 mm 
plates of COS— 1 cells. As a control, carrier DNA alone was also transfected into COS— 1 cells. Cell culture 
media were sampled five and seven days later and tested for the presence of polypeptides possessing the 
immunological properties of naturally occurring human EPO. 

30 

B. Second EPO Expression System Involving COS — 1 Cells 

Still another system was designed to provide improved production of human EPO polypeptide 
material coded by the human genomic DNA EPO clone in COS— 1 cells (AT.C.C. No. CRL— 1650). 

In the immediately preceding system, EPO was expressed in COS— 1 cells using its own promoter 
35 which is within the 5.6 Kb BamH\ to Hind\\\ restriction fragment. In the following construction, the EPO gene 
is altered to that it is expressed using the SV40 late promoter. 

More specifically, the cloned 5.6 Kb BamHl to Hind\\\ genomic human EPO restriction fragment was 
modified by the following procedures. Plasmid pUC8— HuE, as described above, was cleaved with BamH\ 
and with BstEU restriction endonucieases. BstEll cleaves within the 5.6 Kb EPO gene at a position which is 
40 44 base pairs 5' to the initiating ATG coding for the pre-peptide and approximately 680 base pairs 3' to the 
Hindlll restriction site. The approximately 4900 base pair fragment was isolated. A synthetic linker DNA 
fragment, containing SaA and BstBl sticky ends and an internal BamHl recognition site was synthesized 
and purified. The two fragments were mixed and ligated with plasmid pBR322 which had been cut with Sa/I 
and BamHl to produce the intermediate plasmid pBRgHE. The genomic human EPO gene can be isolated 
4$ therefrom as a 4900 base pair BamHl digestion fragment carrying the complete structural gene with a 
single ATG 44 base pairs 3' to BamHl site adjacent the amino terminal coding region. 

This fragment was isolated and inserted as a BamHl fragment into BamHl cleaved expression vector 
plasmid pDSVU (described in Example 6). The resulting plasmid, pSLVgHuEPO, as illustrated in Figure 4, 
was used to express EPO polypeptide material from COS— 1 cells, as described in Examples 6 and 7A. 

50 

Example 8 

Culture media from growth of the six transfected COS— 1 cultures of Example 6 were analyzed by 
radioimmunoassay according to the procedures set forth in Example 2, Part B. Each sample was assayed at 
250, 125, 50, and 25 microliter aliquot levels. Supernatants from growth of cells mock transfected or 

55 transfected with vectors having incorrect EPO gene orientation were unambiguously negative for EPO 
immunoreactivity. For each sample of the two supernatants derived from growth of COS— 1 cells 
transfected with vectors (H and L) having the EPO DNA in the correct orientation, the % inhibition of 
125 l— EPO binding to antibody ranged from 72 to 88%, which places all values at the top of the standard 
curve. The exact concentration of EPO in the culture supernatant could not then reliably be estimated. A 

eo quite conservative estimate of 300 mU/ml was made, however, from the value calculation of the largest 
aliquot size (250 microliter). 

A representative culture fluid according to Example 6 and five and seven day culture fluids obtained 
according to Example 7A were tested in the RIA in order to compare activity of recombinant monkey and 
human EPO materials to a naturally-occurring human EPO standard and the results are set out in graphic 

65 form in Figure 1. Briefly, the results expectedly revealed that the recombinant monkey EPO significantly 
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competed for anti-human EPO antibody although it was not able to completely inhibit binding under the 
test conditions. The maximum percent inhibition values for recombinant human EPO, however, closely 
approximated those of the human EPO standard. The parallel nature of the dose response curves suggest 
immunological identity of the sequences (epitopes) in common. Prior estimates of monkey EPO in culture 
5 fluids were re-evaluated at these higher dilution levels and were found to range from 2.91 to 3.12 U/ml. 
Estimated human EPO production levels were correspondingly set at 302 mil/ml for the five-day growth 
sample and 567 mil/ml for the seven day growth sample. Estimated monkey EPO production levels in the 
Example 78 expression system were on the same order or better. 

io Example 9 

Culture fluids prepared according to Examples 6 and 7 were subjected to an in vitro assay for EPO 

activity according to the procedure of Goldwasser, et al., Endocrinology, 97, 2, pp. 315 — 323 <1975). 

Estimated monkey EPO values for culture fluids tested ranged from 3.2 to 4.3 U/ml. Human EPO culture 

fluids were also active in this in vitro assay and, further, this activity could be neutralized by anti-EPO 
is antibody. The recombinant monkey EPO culture fluids according to Example 6 were also subjected to an 

assay for in vivo biological activity according to the general procedures of Cotes, et al.. Nature, 191, pp. 

1065—1067 (1961) and Hammond, et al., Ann.N. YAcad.ScL, 149, pp. 516—527 (1968) and activity levels 

ranged from 0.94 to 1.24 U/ml. 

Example 10 

20 In the previous examples, recombinant monkey or human EPO material was produced from vectors 
used to transfect COS— 1 cells. These vectors replicate in COS— 1 cells due to the presence of SV40 T 
antigen within the cell and an SV40 origin of replication on the vectors. Though these vectors produce 
useful quantities of EPO in COS — 1 cells, expression is only transient (7 to 14 days) due to the eventual loss 
of the vector. Additionally, only a small percentage of COS — 1 became productively transfected with the 

25 vectors. The present example describes expression systems employing Chinese hamster ovary (CHO) 
DHFR" cells and the selectable marker, DHFR. [For discussion of related expression systems, see U.S. 
Letters Patent No. 4,399,216 and European Patent Applications 117058, 117059 and 117060, all published 
August 29, 1984.) 

CHO DHFR* cells (DuX— B11) CHD K1 cells, Urlaub, et al., Proc. Nat Acad. ScL (U.S.A.), Vol. 77, 4461 
30 (1980) lack the enzyme dihydrofolate reductase (DHFR) due to mutations in the structural genes and 
therefore require the presence of glycine, hypoxanthine, and thymidine in the culture media. Plasmids 
pDSVL — MkE (Example 6) or pDSVL — gHuEPO (Example 7B) were transfected along with carrier DNA into 
CHO DHFR - cells growing in media containing hypoxanthine, thymidine, and glycine in 60 mm culture 
plates. Plasmid pSVgHuEPO (Example 7A) was mixed with the plasmid pMG2 containing a mouse 
35 dihydrofolate reductase gene cloned into the bacterial plasmid vector pBR322 (per Gasser, et al., supra.) 
The plasmid mixture and carrier DNA was transfected into CHO DHFR" cells. (Cells which acquire one 
plasmid will generally also acquire a second plasmid). After three days, the cells were dispersed by 
trypsinization into several 100 mm culture plates in media lacking hypoxanthine and thymidine. Only those 
cells which have been stably transformed with the DHFR gene, and thereby the EPO gene, survive in this 
40 media. After 7 — 21 days, colonies of surviving cells became apparent These transformant colonies, after 
dispersion by trypsinization can be continuously propagated in media lacking hypoxanthine and 
thymidine, creating new cell strains (e.g., CHO pDSVL — MkEPO, CHO pSVgHuEPO, 
CHO-pDSVL-gHuEPO). 

Culture fluids from the above cell strains were tested in the RIA for the presence of recombinant 
45 monkey or human EPO. Media for strain CHO pDSVL— MkEPO contained EPO with immunological 
properties like that obtained from COS — 1 cells transfected with plasmid pDSVL— MkEPO. A representative 
65 hour culture fluid contained monkey EPO at 0.60 U/ml. 

Culture fluids from CHO pSVgHuEPO and CHO pDSVL— gHuEPO contained recombinant human EPO 
with immunological properties like that obtained with COS — 1 cells transfected with plasmid pSVgHuEPO 
so or pDSVL — gHuEPO. A representative 3 day culture fluid from CHO pSVgHuEPO contained 2.99 U/ml of 
human EPO and a 5.5 day sample from CHO pDSVL— gHuEPO had 18.2 U/ml of human EPO as measured 
by the RIA. 

The quantity of EPO produced by the cell strains described above can be increased by gene 
amplification giving new cell strains of greater productivity. The enzyme dihydrofolate reductase (DHFR) 

55 which is the product coded for by the DHFR gene can be inhibited by the drug methotrexate (MTX). More 
specifically, cells propagated in media lacking hypoxanthine and thymidine are inhibited or killed by MTX. 
Under the appropriate conditions, (e.g., minimal concentrations of MTX) cells resistant to and able to 
grown in MTX can be obtained. These cells are found to be resistant to MTX due to an amplification of the 
number of their DHFR genes, resulting in increased production of DHFR enzyme. The surviving cells can, in 

60 turn, be treated with increasing concentrations of MTX, resulting in cell strains containing greater numbers 
of DHFR genes. "Passenger genes" (e.g., EPO) carried on the expression vector along with the DHFR gene 
or transformed with the DHFR gene are frequently found also to be increased in their gene copy number. 

As examples of practice of this amplification system, cell strain CHO pDSVL— MkE was subjected to 
increasing MTX concentrations (0 nM, 30 nM and 100 nM). Representative 65-hour culture media samples 

b£ from each amplification step were assayed by RIA and determined to contain 0.60, 2.45 and 6.10 U/ml, 
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re A P n eCt ^ C . e ," Strai " CH ° P DSVL -9H"EPO was subjected to a series of increasing MTX concentrations 
°1 3 ?n„ M '^!^ 1 °° nM ' 200 nM ' 1 MM ' and 5 mM MTX - A representative 3-day culture media sample from 
the 100 nM MTX step contained human EPO at 3089 ± 129 u/ml as judged by RIA. Representative 48 hour 
l^l^.l ?'^ m sam P |es from the 100 nM and 1 UM MTX steps contained, respectively, human EPO at 466 
and 1352 U/ml as judged by RIA (average of triplicate assays), in these procedures, 1 x 10 6 cells were plated 
m 5 ml of media in 60 mm culture dishes. Twenty-four hours later the media were removed and replaced 
with 5 ml of serum-free media (high glucose DMEM supplemented with 0.1 mM non-essential amino acids 
am Z 1- ° Wa , S f " 0VV ,? d t0 accumulate for 4 8 hours in the serum-free media. The media was 
m ,/ RIA M assay and the ce,ls were fVPsinized and counted. The average RIA values of 467 U/ml and 
filln MM.L»- r TK 9r0Wn 31 1 n° nM and 1 MM MTX ' res Pectively, provided actual yields of 2335 U/plate and 
6750 u/plate. The average cell numbers per plate were 1.94 x 10 s and 3.12 x 10 s cells, respectively The 
effective production rates for these culture conditions were thus 1264 and 2167 U/10 6 cells/48 hours 
S^Jtll^ CU ' tl T des cribed immediately above are a genetically heterogeneous population. 
Standard screening procedures are being employed in an attempt to isolate genetically hemogeneous 
clones with the highest production capacity. See, Section A, Part 2, of "Points to ConsiderTn the 
Characterization of Cell Lines Used to Produce Biologies", June 1, 1984, Office of Biologies Research 
Rev.ew, Center for Drugs and Biologies, U.S. Food and Drug Administration. "esearcn 

rpii 152,^21!?^ ° f EP ° P roducin 9 CH0 cell lines described above can be improved by appropriate 
cell culure techniques. The propagation of mammalian cells in culture generally requires the presence of 

^c n o^n e s 9 e r ZI h m6d H ia - , A T h ° d f u° r pr ° dUCti0n 0f er Vt hra P0ietin from CHO cells "IffiSSS 
m^thnri H«rrihTK 9 , reatly the P urificati °" <* erythropoietin from the culture medium. The 

SSSi^SS!^ producin9 efythrop ° ietin in «™*« «" ' a *° 

cell SZ C fl«| P «= D ?-hh~ 9 .? UEP0 Ce " S ' 9r0Wn Standard ce " culture cond i«°ns, are used to seed spinner 
ml^™.,^ J \^L are as a suspension cell line in the spinner ceil culture flask in 

^ ™ i -^o ™T e ° f hi9h 9,ucose DMEM and Ham ' 8 F12 supplemented with 5% feta" 

calf serum, L-glutamine, Penicillin and Streptomycin, 0.05 nM non-essential amino acids and the 
appropriate concentration of methotrexate. Suspension cell culture allows the EPO-producing CHO cells to 
ilfaT h* e H S " Y t0 ,a ;9 e c volum 7 eS - CH ° Ce " S ' 9rown in suspension, are used to seed rolls? bottles at an 
LTIh t«'^ T ,tY °l 15 * 10 Viable ce " s per 850 cm * rol,er bottle 200 ml of ™dia. The cells are 
nh°,« nf to confluency as an adherent cell line over a three-day period. The media used for this 

1 9 ' S the same as used for growth in suspension. At the end of the three-day growth 
«fh- T"? COnta ^'. n I ?.. r!,ed,a is removed and re P |aced with 100 ml <* serum-free media; 60-50 
ri^m high glucose DMEM and Ham's F12 supplemented with 0.05 mM non-essential amino acidslnd 
L-glutam.ne. The roller bottles are returned to the roller bottle incubator for a period of 1—3 hours and the 

th! Jfrfi 8 ?,!! re "!? Ved !. nd rep ! aced with 100 ml of fresh serum-free media. The 1-3 hour incubation of 
!2f. J? - J 3 red , UCeS the conc *« r "«««' of contaminating serum proteins. The roller bottles are 
* '"J 5 "* .l t0r f ° r Se ?" days during whlch erythropoietin accumulates in the serum-free culture 
media. At the end of the seven-day production phase, the conditioned media is removed and replaced with 
tresn serum-free medium for a second production cycle. As an example of the practice of this production 

ZawJ£u^T% M 2!?^-" njm4n ? media sam P' e contained human erythropoietin at 3892 ± 
m ^ J f Judged by t ! ,e RIA - Base d on an estimated cell density of 0.9 to 1.8 x 10 s cells/cm 2 , each 850 cm 2 
ro ler bottle contained from 0.75 to 1.5 x 10 8 cells and thus the rate of production of EPO in the 7-dav 100 
ml culture was 750 to 1470 H/10 6 cells/48 hours. *' 

Culture fluids from cell strain CHO pDSVL-MkEPO carried in 10 nM MTX were subjected to RIA in vitro 
and in vivo EPO activity assays. The conditioned media sample container 41.2 ± 1.4 U/ml of MkEPO as 
measured by the RIA, 41 Jl ± 0.064 U/ml as measured by the in vitro biological activity assay and 42 5 + 5 U/ 

Till^^T V 6 TJS? bi °i° 9ical ac,ivitY assay - Arn ' no ac i d sequencing of polypeptide products 
revealed the presence of EPO products, a principle species having 3 residues of the "leader" sequence 
adjacent he putative amino terminal alanine. Whether this is the result of incorrect membrane processing 
of the polypeptide in CHO cells or reflects a difference in structure of the amino terminus of monkey EPO 
vts-a-vis human EPO, is presently unknown. 

Culture fluids from cell strain CHO pDSVL-gHuEPO were subjected to the three assays. A 5.5 day 
sample contained recombinant human EPO in the media at a level of 18.2 U/ml by RIA assay, 15.8 + 4 6 U/ 
ml by in vitro assay and 16.8 + 3.0 U/ml by in vivo assay. ~ ' 

...hi^H? ^'V mm CH ° PDSVL-gHuEPO cells prepared amplified by stepwise 100 nM MTX were 
subjected to the three assays. A 3.0 day sample contained recombinant human EPO at a level of 3089 ± 1 29 
U/ml by RIA, 2589 ± 71.5 U/ml by in vitro assay, and 2040 ± 160 U/ml by in vivo assay. Amino acid 
sequencing of this product reveals an amino terminal corresponding to that designated in Table VI. 

Cell conditioned media from CHO cells transfected with plasmid pDSVL-MkE in 10 nM MTX were 
P ^L5A a "^w, , d,alvzed out ove r several days, resulting in media with an EPO activity of 221 ± 5 1 U/ 
ml (EPO-CCM). To determine the in vivo effect of the EPO-CCM upon hematocrit levels in normal Balb/C 
JI5S 3 ' the f0 "o w| ng experiment was conducted. Cell conditioned media from untransfected CHO cells 
CCM) and EPC--CCM were adjusted with PBS. CCM was used forthe control group (3 mice) and two dose 
levels of EPO-CCM — 4 units per injection and 44 units per injection — were employed for the 
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experimental groups (2 mice/group). Over the course of 5 weeks, the seven mice were injected 
intraperitoneal^, 3 times per week, After the eighth injection, average hematocrit values for the control 
group were determined to be 50.4%; for the 4U group, 55.1%; and, for the 44U group, 67.9%. 

Mammalian cell expression products may be readily recovered in substantially purified form from 
culture media using HPLC (C 4 ) employing an ethanol gradient, preferably at pH7. 

A preliminary attempt was made to characterize recombinant glycoprotein products from conditioned 
medium of COS— 1 and CHO cell expression of the human EPO gene in comparison to human urinary EPO 
isolates using both Western blot analysis and SDS — PAGE. These studies indicated that the CHO-produced 
EPO material had a somewhat higher molecular weight than the COS— 1 expression product which, in turn, 
was slightly larger than the pooled source human urinary extract. All products were somewhat 
heterogeneous. Neuraminidase enzyme treatment to remove sialic acid resulted in COS — 1 and CHO 
recombinent products of approximately equal molecular weight which were both nonetheless larger than 
the resulting asialo human urinary extract. Endoglycosidase F enzyme (EC 3.2.1) treatment of the 
recombinant CHO product and the urinary extract product (to totally remove carbohydrate from both) 
resulted in substantially homogeneous products having essentially identical molecular weight 
characteristics. 

Purified human urinary EPO and a recombinant, CHO cell-produced, EPO according to the invention 
were subjected to carbohydrate analysis according to the procedure of Ledeen, et al. Methods in 
Enzymology, 83(PartD), 139—191 (1982) as modified through use of the hydrolysis procedures of Nesser, 
et al., AnaLBiochem., 142, 58—67 (1984). Experimentally determined carbohydrate constitution values 
(expressed as molar ratios of carbohydrate in the product) for the urinary isolate were as follows: Hexoses, 
1.73; N-acetylglucosamine, 1; N-acetylneuraminic acid, 0.93; Fucose, 0; and N -acetylgalactosamine, 0. 
Corresponding values for the recombinant product (derived from CHO pDSVL— gHuEPO 3-day culture 
media at 100 nM MTX) were as follows; Hexoses, 15.09; N-acetylglucosamine, 1; N-acetylneuraminic acid, 
0.998; Fucose, 0; and N-acetylgalactosamine, 0. These findings are consistent with the Western blot and 
SDS — PAGE analysis described above. 

Glycoprotein products provided by the present invention are thus comprehensive of products having a 
primary structural conformation sufficiently duplicative of that of a naturally-occurring erythropoietin to 
allow possession of one or more of the biological properties thereof and having an average carbohydrate 
composition which differs from that of naturally-occurring erythropoietin. 

Example 11 

The present example relates to the total manufacture by assembly of nucleotide bases of two structural 
genes encoding the human species EPO sequence of Table VI and incorporating, respectively "preferred" 
codons for expression in E.coii and yeast (S.cerevisiae) cells. Also described is the construction of genes 
encoding analogs of human EPO. Briefly stated, the protocol employed was generally as set out in the 
previously noted disclosure of Alton, et al. (WO 83/04053). The genes were designed for initial assembly of 
component oligonucleotides into multiple duplexes which, in turn, were assembled into three discrete 
sections. These sections were designed for ready amplification and, upon removal from the amplification 
system, could be assembled sequentially or through a multiple fragment ligation in a suitable expression 
vector. 

Tables VIII through XIV below illustrate the design and assembly of a manufactured gene encoding a 
human EPO translation product lacking any leader or presequence but including an initial methionine 
residue at position —1. Moreover, the gene incorporated in substantial part E.coii preference codons and 
the construction was therefore referred to as the "ECEPO" gene. 
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TABLE VIII 
ECSPO SECTION 1 OLIGONUCLEOTIDES 



1. 


AATTCTAGAAACCATGAGGGTAATAAAATA 


2. 


CCATTATTTTATTACCCTCATGGTTTCTAG 


3. 


ATGGCTCCGCCGCGTCTGATCTGCGAC 


A. 


CTCGAGTCGCAGATCAGACGCGGCGGAG 


5. 


TCGAGAGTTCTGGAACGTTACCTGCTG 


6. 


CTTcrAnranrcTfl ApRTTrr ap a at t 

w i i wWnuUnuu 1 MHLu t t LUnunML 1 


7. 


GAAGCTAAAGAAGCTGAAAACATC 


8. 


GTGGTGATGTTTTCAGCTTCTTTAG 


9. 


ACCACTGGTTGTGCTGAACACTGTTC 


10. 


CAAAGAACAGTGTTCAGCACAACCA 


11. 


TTTGAACGAAAACATTACGGTACCG 


12. 


GATCCGGTACCGTAATGTTTTCGTT 



TABLE IX 



EC EPO SECTION 1 

Xba l 

EcoRI 1 3 

AATTCTAG AAACCATGAG GGTAATAAAA TaIaTGGCTCC GCCGCGTCTG 
GATC TTTGGTACTC CCATTATTTT ATTACTjGAGG CGGCGCAGAC 

2 1 4 

atctgcgac It cgaga gTtct ggaacgttac ctgctgIgaag ctaaagaagc 
tagacgctga gctcjtcaaga ccttgcaatg gacgaccttcj gatttcttcg 



■ £ .11 

WCCACT GGTT GTGCTGAACA CTGTTC jTTTG AACGAAAACA 
TGGTGpCCAA CACGACTTGT GACAAGATO] TTGCTTTTGT 



_ . _ .11 

TGAAAACATC 

ACTTTTGTAG 

8 ' 10 



Kon I Bam HI 
TTACGGTACC G 
AATGCCATGG CCTAG 
12 
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TABLE X 
SECTION 2 OLIGONUCLEOTIDES 


1. 


AAT TP nnTflPPflp.Qpflpp a a ppt 


2. 


GTTfl flPPTTCPTPTPTPPT Ar*r*r» 


3. 


TAAPTTPTflPPPTTP.Pflfl zippta t 


4 . 


TTPPflT flPCTTTPP AArrrTAnA « 
i 1 1 LLAAGuGTAGAA 


5 . 


««nnUI 1 bb 1 LAAu AAGC AGTTGA AGT 


O • 


LCAAACTTCAACTGCTTGTTGACCAAC 


7 


TTGGCAGGGTCTGGCACTGCTGAGCG 


Q 

O • 


GCCTCGCTCAGCAGTGCCAGACCCTG 


Q 


AGGCTGTACTGCGTGGCCAGGCA 




GCAGTGCCTGGCCACGCAGTACA 


li * 


CTGCTGGTAAACTCCTCTCAGCCGT 




TTCCCACGGCTGAGAGGAGTTTACCA 


13. 


GGGAACCGCTGCAGCTGCATGTTGAC 


14. 


GCTTTGTCAACATGCAGCTGCAGCGG 


15. 


AAAGCAGTATCTGGCCTGAGATCTG 


16. 


GATCCAGATCTCAGGCCAGATACT 



TABLE XI 
ECEPO SECTION 2 
EcoRI Kgnl 1 3 

A ATTCGGTACC AGACACCAAG GTlTAACTTCT ACGCTTGGAA ACGTATbcAA 
GCCATGG TCTGTGGTTC CAATTGj^AGA TGCGAACCTT TGCATACCTfJ 

GTTGGTCAAC AAGCAGTTGA AGT jrTGGC AC GGTCTGGCAC TGCTGAGCGfe 
CAACCAGTT-G TTCGTCAACT TCAHCCjSTC CCAGACCGTG ACGACTCGCT 

- 1 

GGCTCTACTG CGTGGCCAGG CACTGCTGGT AAACTCCTCT CAGCCGTGGG 

ccgHcatgac gcaccggtcc gtgaTgJacca TTTGAGGAGA GTCGGCAETc" 

10 » 12 

aaccgctgTa gctgcatgtt cac Ihaacca g tatcMgcct GAcfrffc ~ HI 

TTjGGCGACGT CGACGTACAA CTGTffCGlrc ATAGACCGGA CTCTAGACCTAC 
ii 16 
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TABLE XII 
ECEPO SECTION 3 

1. GATCCAGATCTCTGACTACTCTGC 

2. ACGCAGCAGAGTAGTCAGAGATCTG 

3. TGCGTGCTCTGGGTGCACAGAAAGAGG 

4. GATAGCCTCTTTCTGTGCACCCAGAGC 

5. CTATCTCTCCGCCGGATGCTGCATCT 

6. CAGCAGATGCAGCATCCGGCGGAGA 

7. GCTGCACCGCTGCGTACCATCACTG 

8. ATCAGCAGTGATGGTACGCAGCGGTG 

9. CTGATACCTTCCGCAAACTGTTTCG 

10 . ATACACGAAACAGTTTGCGGAAGGT 

11 . TGTATACTCTAACTTCCTGCGTGGTA 

12 . CAGTTT ACCACGC AGGA AGTTAGAGT 

13 . AACTGAAACTGTATACTGGCGAAGC 

14. GGCATGCTTCGCCAGTATACAGTTT 

15 . ATGCCGTACTGGTGACCGCTAATAG 

16 . TCGACTATTAGCGGTCACCAGTAC 
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TABLE XIII 
ECEPO SECTION 3 



BamHI BglH 
GA TCCAGATCTCTG 
GTCTAGAGAC 



I L I , 5 

ACTACTCTGC [TGCGT GCTCT GGGTGCACAG AAAGAGG CTA TC TCTCCGCC 

TGATGAGACG AC GC AC GAGA CCCACGTGTC TTTCTCCGAT AGJAGAGGCGG 
2 A 1 

, 1 , £ 

GGATGCTGCA TCTCCTGCAC CGCTGCGTAC CATCACT GCT GAT ACCTTCC 

CCTACGACGT AGACtfACfcTG GCGACGCATG GTAGTGACGA CTflTGGAAGG 
6 r 8 



11 



13 



GCAAACTGTT TCGjTGTATAC TCTAACTTCC TGCGTGGTAl A ACTGA AACTG 
CGTTTGACAA AGCACATAfTG AGATTGAAGG ACGCACCATT TGACfTTTGAC 
10 12 

15 Sai l 
TATACTGGCG AAGC frTGCC G TACTGGTGAC CGCTAATAG 
ATATGACCGC TTCGT ACGCfc ATGACCACTG GCGATTATC AGCT 
14 ^16 
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45 



50 



55 



60 
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TABLE XIV 
ECEPO GENE 

-1 1 

Xbal MetAla 
CTAG AAACCATGAG GGTAATAAAA TAATGGCTCC GCCGCGTCTG 
TTTGGTACTC CCATTATTTT ATTACCGAGG CGGCGCAGAC 

ATCTGCGACT CGAGAGTTCT GGAACGTTAC CTGCTGGAAG CTAAAGAAGC 
TAGACGCTGA GCTCTCAAGA CCTTGCAATG GACGACCTTC GATTTCTTCG 

TGAAAACATC ACCACTGGTT GTGCTGAACA CTGTTCTTTG AACGAAAACA 
ACTTTTGTAG TGGTGACCAA CACGACTTGT GACAAGAAAC TTGCTTTTGT 



20 



TTACGGTACC AGACACCAAG GTTAACTTCT ACGCTTGGAA ACGTATGGAA 
AATGCCATGG TCTGTGGTTC CAATTGAAGA TGCGAACCTT TGCATACCTT 



25 



GTTGGTCAAC AAGCAGTTGA AGTTTGGCAG GGTCTGGCAC TGCTGAGCGA 
CAACCAGTTG TTCGTCAACT TCAAACCGTC CCAGACCGTG ACGACTCGCT 



30 



GGCTGTACTG CGTGGCCAGG CACTGCTGGT AAACTCCTCT CAGCCGTGGG 
CCGACATGAC GCACCGGTCC GTGACGACCA TTTGAGGAGA GTCGGCACCC 



35 



40- 



AACCGCTGCA GCTGCATGTT GACAAAGCAG TATCTGGCCT GAGATCTCTG 
TTGGCGACGT CGACGTACAA CTGTTTCGTC ATAGACCGGA CTCTAGAGAC 

ACTACTCTGC TGCGTGCTCT GGGTGCACAG AAAGAGGCTA TCTCTCCGCC 
TGATGAGACG ACGCACGAGA CCCACGTGTC TTTCTCCGAT AGAGAGGCGG 

GGATGCTGCA TCTGCTGCAC CGCTGCGTAC CATCACTGCT GATACCTTCC 
CCTACGACGT AGACGACGTG GCGACGCATG GTAGTGACGA CTATGGAAGG 



45 



50 



GCAAACTGTT TCGTGTATAC TCTAACTTCC TGCGTGGTAA ACTGAAACTG 
CGTTTGACAA AGCACATATG AGATTGAAGG ACGCACCATT TGACTTTGAC 

Sail 

TALACTGGCG AAGCATGCCG TACTGGTGAC CGCTAATAG 

ATATGACCGC TTCGTACGGC ATGACCACTG GCGATTATCA GCT 



55 



60 



65 



JJ° re particularly. Table VIII illustrates oligonucleotides employed to generate the Section 1 of the 
ECEPO gene encoding amino terminal residues of the human species polypeptide. Oligonucleotides were 
assembled into duplexes (/ and 2, 3 and 4, etc.) and the duplexes were then ligated to provide ECEPO 
Section 1 as in Table IX. Note that the assembled section includes respective terminal fcoRI and BamW\ 
sticky ends, that "downstream" of the £coRI sticky end is a Xba\ restriction enzyme recognition site; and 
that upstream of the BamH\ sticky end is a Kpn\ recognition site. Section 1 could readily be amplified 
using the M13 phage vector employed for verification of sequence of the section. Some difficulties were 
encountered in isolating the section as an Xba\IKpn\ fragment from RF DNA generated in Ecotf, likely due 
to methylation of the Kpn\ recognition site bases within the host. Single-stranded phage DNA was therefore 
isolated and rendered into double-stranded form in vitro by primer extension and the desired double- 
stranded fragment was thereafter readily isolated. 
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ECEPO gene Sections 2 and 3 (Tables XI and XIII) were constructed in a similar manner from the 
oligonucleotides of Tables X and XII, respectively. Each section was amplified in the Ml 3 vector employed 
for sequence verification and was isolated from phage DNA. As is apparent from Table XI, ECEPO Section 2 
was constructed with EcoHl and BamWl sticky ends and could be isolated as a KpnMBglW fragment. 
Similarly, ECEPO Section 3 was prepared with BamH\ and Sal\ sticky ends and could be isolated from 
phage RF DNA as a BglWISail fragment. The three sections thus prepared can readily be assembled into a 
continuous DNA sequence (Table XIV) encoding the entire human species EPO polypeptide with an amino 
terminal methionine codon (ATG) for E.coli translation initiation. Note also that "upstream" of the initial 
ATG is a series of base pairs substantially duplicating the ribosome binding site sequence of the highly 
expressed OMP— f gene of E.coli. 

Any suitable expression vector may be employed to carry the ECEPO. The particular vector chosen for 
expression of the ECEPO gene as the "temperature sensitive" plasmid pCFM536 — a derivative of plasmid 
pCFM414 (A.T.C.C. 40076) — as described in co-pending U.S. Patent Application Serial No. 636,727, filed 
August 6, 1984, by Charles F. Morris. More specifically, pCFM536 was digested with Xba\ and HindlW; the 
large fragment was isolated and employed in a two-part ligation with the ECEPO gene. Sections 1 (Xball 
Kpn\), 2 (KpnMBglW) and 3 (Bgl\\/Saf\) had previously been assembled in the correct order in M13 and the 
EPO gene was isolated therefrom as a single Xba\IHind\\\ fragment. This fragment included a portion of the 
polylinker from M13 rnp9 phage spanning the Sail to HindlW sites therein. Control of expression in the 
resulting expression plasmid, p536, was by means of a lambda P L promoter, which itself may be under 
control of the C I8S7 repressor gene (such as provided in E.coli strain K12AHtrp). 

The manufactured ECEPO gene above may be variously modified to encode erythropoietin analogs 
such as (Asn 2 , des-Pro 2 through lle 6 ]hEPO and [His 7 ]hEPO, as described below. 

A. [Asn 2 , des-Pro 2 through lle 6 )hEPO 

Plasmid 536 carrying the ECEPO manufactured gene of Table XIV as a Xba\ to Hind\\\ insert was 
digested with HindlW and Xhol. The latter endonuclease cuts the ECEPO gene at a unique, 6 base pair 
recognition site spanning the last base of the codon encoding Asp 8 through the second base of the Arg 10 
codon. A Xba\IXho\ "linker" sequence was manufactured having the following sequence: 

Xbal +1 2 7 8 9 

Met Ala Asn Cys Asp Xho l 

5 -CTAG ATG GCT AAT TGC GAC-3* 

3'-TAC CGA TTA ACG CTG AGCT-5* 

The Xba\IXho\ linker and the Xho\IHind\\\ ECEPO gene sequence fragment were inserted into the large 
fragment resulting from Xba\ and Hind\\\ digestion of plasmid pCFM526 — a derivative of plasmid 
pCFM414 (A.T.C.C. 40076) — as described in co-pending U.S. Patent Application Serial No. 636,727, filed 
August 6, 1984, by Charles F. Morris, to generate a plasmid-borne DNA sequence encoding E.coli 
expression of the Met" 1 form of the desired analog. 

B. [His 7 JhEPO 

Plasmid 536 was digested with HindlW and Xhol as in part A above. A XballXhol linker was 
manufactured having the following sequence: 

Xbal +1 23456789 Xhol 

Met Ala Pro Pro Arg Leu lie His Asp 
5 -CTAG ATG GCT CCG CCA CGT CTG ATC CAT GAC-3'' 

3 -TAC CGA GGC GGT GCA GAC TAG GTA CTG AGCT-5* 

The linker and the XhollHindlW ECEPO sequence fragment were then inserted into pCFM526 to 
generate a plasmid-borne DNA sequence encoding E.coii expression of the Met" 1 form of the desired 
analog. 

Construction of a manufactured gene ("SCEPO") incorporating yeast preference codons is as 
described in the following Tables XV through XXI. As was the case with the ECEPO gene, the entire 
construction involved formation of three sets of oligonucleotides (Tables XV, XVII and XIX) which were 
formed into duplexes and assembled into sections (Tables XVI, XVIII and XX). Note that synthesis was 
facilitated in part by use of some sub-optimal codons in both the SCEPO and ECEPO constructions, i.e., 
oligonucleotides 7—12 of Section 1 of both genes were identical, as were oligonucleotides 1—6 of Section 
2 in each gene. 
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TABLE XV 


SCEPO 


SECTION 1 0L IGONUCLEOTIDF^ 


X • 


AATTCAAGCTTGGATAAAAGAGCT 


£ » 


bl UbAGC ICT T TTATCCAAGCTTG 




CCACCAAGATTGATCTGTGACTC 


h 

H • 


TCTCGAGTCACAGATCAATCTTG 


c 


OA^A^TTT^^#HA A A n a ^ * v w & v m 

GAGAGTTTTGGAAAGATACTTGTTG 


£ 

O • 


^ T ^ A AnAAA^AV M ^ ^ *m M a * m m. a 

CTTCCAACAAGTATCTTTCCAAAAC 


/ • 


GAAGCTAAAGAAGCTGAAAACATC 


o 

0 . 


GTGGTGATGTTTTCAGCTTCTTTAG 


9. 


ACCACTGGTTGTGCTGAACACTGTTC 


10. 


CAAAGAACAGTGTTCAGCACAACCA 


11. 


TTTGAACGAAAACATTACGGTACCG 


12, 


GATCCGGTACCGTAATGTTTTCGTT 



TABLE XVI 
SCEPO SECTION 1 



EcoRI riindlll 1 
AATTCA AGCTTGGATA 
G T TCGAACCTAT 
2 



AAAGAGCT^C^ACCAAGATTG ATCTGTGACT cIjAGAGTTTT 
TTTCTCGAGG TGjGTTCTAAC TAGACACTGA GCTiClfcAAAA 

£ 

5 7 

GGAAAGATAC TTGTTG pAAG CTAAAGAAGC TGAAAACATC hCCACTGGTT 
CCTTTCTATG AACAACCTTc] GATTTCTTCG ACTTTTGTAG tTJgTgJACCAA 

6 8 • 

^o«t«-I-„- . ii Kpn l BamHI 

GTGCTGAACA CTGTTC |TTTG AACGAAAACA TTACGGTACC G 

CACGACTTGT GACAAGAAACl TTGCTTTTGT AATGCCATGG CCTAG 

• 12 
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TABLE XVII 



SCEPO 


SECTION 2 OLIGONUCLEOTIDES 


1. 


AATTCGGTACCAGACACCAAGGT 


2. 


GTTAACCTTGGTGTCTGGTACCG 


3. 


TAACTTCTACGCTTGGAAACGTAT . 


4. 


TTCCATACGTTTCCAAGCGTAGAA 


5. 


GGAAGTTGGTCAACAAGCAGTTGAAGT 


6. 


CCAAACTTCAACTGCTTGTTGACCAAC 


7. 


TTGGCAAGGTTTGGCCTTGTTATCTG 


8. 


GCTTCAGATAACAAGGCCAAACCTTG 


9. 


AAGCTGTTTTGAGAGGTCAAGCCT 


10. 


AACAAGGCTTGACCTCTCAAAACA 


11. 


TGTTGGTTAACTCTTCTCAACCATGGG 


12. 


TGGTTCCCATGGTTGAGAAGAGTTAACC 


13. 


AACCATTGCAATTGCACGTCGAT 


14. 


CTTTATCGACGTGCAAT-TGCAA 


15. 


AAAGCCGTCTCTGGTTTGAGATCTG 


16. 


GATCCAGATCTCAAACCAGAGACGG 
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TABLE XVIII 
SCEPO SECTION 2 



Kpn l 
EcoRI 1 

A ATTCGGTACC AGACACCAAG 
GCCATGG TCTGTGGTTC 
2 



, 2 .5 

GTITAACTTCT ACGCTTGGAA ACGTAT EGAA GTTGGTCAAC AAGCTGTTGA 

CAATTGAAGA TGCGAACCTT TGCATACCTfj CAACCAGTTG TTCGACAACT 

AGTjTTGGCAA GGTTTGGCCT TGTTATCTG p AGCT GTTTTG AGAGGTCAAG 

TCAAACCpTT CCAAACCGGA ACAATAGACT TCgIaCAAAAC TCTCCAGTTC 

8 10 

CCTiTGTTCGT TAACTCTTCT CAACCATGGG MCCAT TGCA ATTGCACGTC 

GGARCAACCA ATTGAGAAGA GTTGGTACCC TTGGTbACGT TAACGTGCAG 

* 12 ii 

15 BolII BamHI 

GAT RAAGC CG TCTCTGGTTT GAGATCTG 

CTATTfcfcGC AGAGACCAAA CTCTAGACCTA G 

16 
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TABLE XIX 


scepo 


SECTION 3 OLIGONUCLEOTIDES 


l • 


^AT/*f*A/*ATf*TTT/*A^*T A/^VTT^TT 

GATCCAGATCTTTGACTACTTTGTT 


2. 


TCTCAACAAAGTAGTCAAAGATCTG 


3. 


GAGAGCTTTGGGTGCTCAAAAGGAAG 


* 


ATGGCTTCCTTTTGAGCACCCAAAGC 


5. 


CCATTTCCCCACCAGACGCTGCTT 


6. 


GCAGAAGCAGCGTCTGGTGGGGAA 


7. 


A% ^A A At* 4** A^ A*« 4Va A^ A 4V ^A A^ A AV A A ^A A A ^A A^ 

CTGCCGCTCCATTGAGAACCATC 


8. 


CAGTGATGGTTCTCAATGGAGCG 


9. 


ACTGCTGATACCTTCAGAAAGTT 


10. 


GAATAACTTTCTGAAGGTATCAG 


11. 


ATTCAGAGTTTACTCCAACTTCT 


12. 


CTCAAGAAGTTGGAGTAAACTCT 


13. 


^A a*a. A A A Mb Afe *A^ AAA ^B> ^Bk A A ^Bh ^Bfe. *Bt A ^Bfe, A *A*l 

TGAGAGGTAAATTGAAGTTGTACAC 


14. 


ACCGGTGTACAACTTCAATTTACCT 


15 . 


CGGTGAAGCCTGTAGAACTGGT 


16. 


A% *»W 4h ^A Jh A dB* ^Bk JB AfB» A^B) ^BB mm A ^Ah ^Ak Jh «■* ^Bfe *^A. 

CTGTCACCAGTTCTACAGGCTTC 


17. 


GACAGATAAGCCCGACTGATAA 


18. 


GTTGTTATCAGTCGGGCTTAT 


19. 


CAACAGTGTAGATGTAACAAAG 


20. 


TCGACTTTGTTACATCTACACT 
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TABLE XX 



SCEPO SECTION 3 



BamHI Bqlll 1 

GATC CAGATCTTTG ACTACTTTGT T ^AGAG CTTT 
GTCTAGAAAC TGATGAAACA ACTUT^GAAA 



2 5 

GGGTGCTCAA AAGGAAG fcCA TT TCCCCACC AGACGCTGCT TCTGCCGCTC 

CCCACGAGTT TTCCTTCGGT AjftAGGGGTGG TCTGCGACGA AGACGGCGAG 
£ 6 

1 . 9 11 

CATTGAGAAC CATC fiCTGC T GATACCTTCA GAAAGTT ^TT CA GAGTTTAC 
GTAACTCTTG GTAGTGACfcA CTATGGAAGT CTTTCAATAA — G|rCTCAAATG 
£ 10 12 



TCCAACTTCT jTGAG ACGTAA ATTGAAGTTG TACACfcGGTG AAGCCTGTAG 
AGGTTGAAGA ECTC]TCCATT TAACTTCAAC ATGTGGCTa]C TTCGGACATC 




Sail 



ATGTAACAAA G 
TACATTGTTT CAGCT 
20 
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TABLE XXI 
SCEPO GENE 

-1 +1 
Hindlll ArgAla 

AGCTTGGATA AAAGAGCTCC ACCAAGATTG ATCTGTGACT CGAGAGTTTT 
ACCTAT TTTCTCGAGG TGGTTCTAAC TAGACACTGA GCTCTCAAAA 

GGAAAGATAC TTGTTGGAAG CTAAAGAAGC TGAAAACATC ACCACTGGTT 
CCTTTCTATG AACAACCTTC GATTTCTTCG ACTTTTGTAG TGGTGACCAA 

GTGCTGAACA CTGTTCTTTG AACGAAAACA TTACGGTACC AGACACCAAG 
CACGACTTGT GACAAGAAAC TTGCTTTTGT AATGCCATGG TCTGTGGTTC 

GTTAACTTCT ACGCTTGGAA ACGTATGGAA GTTGGTCAAC AAGCTGTTGA 
CAATTGAAGA TGCGAACCTT TGCATACCTT CAACCAGTTG TTCGACAACT 

AGTTTGGCAA GGTTTGGCCT TGTTATCTGA AGCTGTTTTG AGAGGTCAAG 
TCAAACCGTT CCAAACCGGA ACAATAGACT TCGACAAAAC TCTCCAGTTC 

CCTTGTTGGT TAACTCTTCT CAACCATGGG AACCATTGCA ATTGCACGTC 
GGAACAACCA ATTGAGAAGA GTTGGTACCC TTGGTAACGT TAACGTGCAG 

GATAAAGCCG TCTCTGGTTT GAGATCTTTG ACTACTTTGT TGAGAGCTT T 
CTATTTCGGC AGAGACCAAA CTCTAGAAAC TGATGAAACA ACTCTCGAAA 

GGGTGCTCAA AAGGAAGCCA TTTCCCCACC AGACGCTGCT TCTGCCGCTC 
CCCACGAGTT TTCCTTCGGT AAAGGGGTGG TCTGCGACGA AGACGGCGAG 

CATTGAGAAC CATCACTGCT GATACCTTCA GAAAGTTATT CAGAGTTTAC 
GTAACTCTTG GTAGTGACGA CTATGGAAGT CTTTCAATAA GTCTCAAATG 

TCCAACTTCT TGAGAGGTAA ATTGAAGTTG TACACCGGTG AAGCCTGTAG 
AGGTTGAAGA ACTCTCCATT TAACTTCAAC ATGTGGCCAC TTCGGACATC 

AACTGGTGAC AGATAAGCCC GACTGATAAC AACAGTGTAG 
TTGACCACTG TCTATTCGGG CTGACTATTG TTGTCACATC 

Sai l 

ATGTAACAAA G 
TACATTGTTT CAGCT 



The assembled SCEPO sections were sequenced in M13 and Sections 1. 2 and 3 were isolatable from 
the phage as Hind\UIKpn\IBGI\\, and BgMISall fragments. 

The presently preferred expression system for SCEPO gene products is a secretion system based on 
S.cerevisiae o-factor secretion, as described in co-pending U.S. Patent Application Serial No. 487,753, filed 
April 22, 1983, by Grant A. Bitter, published October 31, 1984 as European Patent Application 0 123 294. 
Briefly put, the system involves constructions wherein DNA encoding the leader sequence of the yeast a- 
factor gene product is positioned immediately 5' to the coding region of the exogenous gene to be 
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expressed. As a result the gene product translated includes a leader or signal sequence which is 
"processed off" by an endogenous yeast enzyme in the course of secretion of the remainder of the product. 
Because the construction makes use of the a-factor translation initiation (ATG) codon, there was no need to 
provide such a codon at the -1 position of the SCEPO gene. As may be noted from Table XXI, the alanine 

5 (+1) encoding sequence is preceded by a linker sequence allowing for direct insertion into a plasmid 
including the DNA for the first 80 residues of the a-factor leader following the a-factor promoter. The 
specific preferred construction for SCEPO gene expression involved a four-part ligation including the 
above-noted SCEPO section fragments and the large fragment of HindUU Sa/I digestion of plasmid paC3. 
From the resulting plasmid paC3/SCEPO, the a-factor promoter and leader sequence and SCEPO gene were 

10 isolated by digestion with BamH\ and ligated into BamH\ digested plasmid pYE to form expression plasmid 
DYE/SCEPO. 

Example 12 

The present example relates to expression of recombinant products of the manufactured ECEPO and 
SCEPO genes within the expression systems of Example 11. 

is In use of the expression system designed for use of E.cofi host cells, plasmid p536 of Example 1 1 was 
transformed into AM7 E.cofi cells previously transformed with a suitable plasmid, pMW1, harboring a C, 8S7 
gene. Cultures of cells in LB broth (Ampicillin 50 pg/ml and kanamycin 5 pg/ml, preferably with 10 mM 
MgS0 4 ) were maintained at 28°C and upon growth of cells in culture to O.D. €0o = 0.1, EPO expression was 
induced by raising the culture temperature to 42°C. Cells grown to about 40 O.D. provided EPO production 

20 (as estimated by gel) of about 5 mg/OD liter. 

Cells were harvested, lysed, broken with French Press (10,000 psi) and treated with lysozyme and 
NP— 40 detergent The pellet resulting from 24,000 xg centrifugation was solubilized with guanidine HCI 
and subjected to further purification in a single step by means of C 4 (Vydac) Reverse Phase HPLC (EtOH, 
0—80%, 50 mM NH 4 Ac, pH 4.5). Protein sequencing revealed the product to be greater than 95% pure and 

25 the products obtained revealed two different amino terminals A— P— P— R . . . and P— P— R ... in a relative 
quantitative ratio of about 3 to 1. This latter observation of hEPO and [des Ala 1 ]hEPO products indicates 
that amino terminal "processing" within the host cells serves to remove the terminal methionine and in 
some instances the initial alanine. Radioimmunoassay activity for the isolates was at a level of 150,000 to 
160,000 U/mg; in vitro assay activity was at a level of 30,000 to 62,000 U/mg; and in vivo assay activity 

30 ranged from about 120 to 720 U/mg. (Cf. r human urinary isolate standard of 70,000 U/mg in each assay.) 
The dose response curve for the recombinant product in the in vivo assay differed markedly from that of 
the buman urinary EPO standard. 

The EPO analog plasmids formed in parts A and B of Example 1 1 were each transformed into pMW1- 
transformed AM7 E.coli cells and the cells were cultured as above. Purified isolates were tested in both RIA 

35 and in vitro assays. RIA and in vitro assay values for [Asn 2 , des-Pro 2 through He 6 ]hEP0 expression products 
were approximately 11,000 U/mg and 6,000 U/mg protein, respectively, while the assay values for 
[His 7 ]hEPO were about 41,000 U/mg and 14,000 U/mg protein, respectively, indicating that the analog 
products were from one-fourth to one-tenth as "active" as the "parent" expression product in the assays. 
In the expression system designed for use of S.cerev/siae host cells, plasmid pYE/SCEPO was trans- 

40 formed into two different strains, YSDP4 (genotype a pep4>—3 trp 1) and RK81 (genotype aa pep4— 3 trp /). 
Transformed YSDP4 hosts were grown in SD medium (Methods in Yeast Genetics, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, N.Y., p. 62 (1983) supplemented with casamino acids at 0.5%, pH 6.5 at 
30°C. Media harvested when the cells had been grown to 36 O.D. contained EPO products at levels of about 
244 Ug/ml (97 ug/OD liter by RIA). Transformed RK81 cells grown to either 6.5 O.D. or 60 O.D. provided 

45 media with EPO concentrations of about 80—90 U/ml (34 ug/OD liter by RIA). Preliminary analyses reveal 
significant heterogeneity in products produced by the expression system, likely to be due to variations in 
glycosylation of proteins expressed, and relatively high mannose content of the associated carbohydrate. 

Plasmids PaC3 and pYE in HB101 Eco/i cells were deposited in accordance with the Rules of Practice of 
the U.S. Patent Office on September 27, 1984, with the American Type Culture Collection, 12301 Parklawn 

so Drive, Rockville, Maryland, under deposit numbers A.T.C.C. 39881 and A.T.C.C. 39882, respectively. 
Plasmids pCFM526 in AM7 cells, pCFM536 in JM103 cells, and pMW1 in JM103 cells were likewise 
deposited on November 21, 1984 as A.T.C.C. 39932, 39934, and 39933, respectively. Saccharomyces 
cerevisiae strains YSPD4 and RK81 were deposited on November 21, 1984 as A.T.C.C. 20734 and 20733, 
respectively. 

ss It should be readily apparent from consideration of the above illustrative examples that numerous 
exceptionally valuable products and processes are provided by the present invention in its many aspects. 

Polypeptides provided by the invention are conspicuously useful materials, whether they are 
microbialry expressed products or synthetic products, the primary, secondary or tertiary structural 
conformation of which was first made known by the present invention. 

60 As previously indicated, recombinant-produced and synthetic products of the invention share, to 
varying degrees, the in vitro biological activity of EPO isolates from natural sources and consequently are 
projected to have utility as substitutes for EPO isolates in culture media employed for growth of 
erythropoietic cells in culture. Similarly, to the extent that polypeptide products of the invention share the 
in vivo activity of natural EPO isolates they are conspicuously suitable for use in erythropoietin therapy 

65 procedures practiced on mammals, including humans, to develop any or all of the effects heretofore 
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attributed in vivo to EPO, e.g., stimulation of reticulocyte response, development of ferrokinetic effects 
(such as plasma from turnover effects and marrow transit time effects), erythrocyte mass changes, 
stimulation of hemoglobin C synthesis (see, Eschbach, et al., supra) and, as indicated in Example 10, 
increasing hematocrit levels in mammals. Included within the class of humans treatable with products of 

5 the invention are patients generally requiring blood transfusions and including trauma victims, surgical 
patients, renal disease patients including dialysis patients, and patients with a variety of blood composition 
affecting disorders, such as hemophilia, sickle cell disease, physiologic anemias, and the like. The 
minimization of the need for transfusion therapy through use of EPO therapy can be expected to result in 
reduced transmission of infectious agents. Products of the invention, by virtue of their production by 

w recombinant methods, are expected to be free of pyrogens, natural inhibitory substances, and the like, and 
are thus likely to provide enhanced overall effectiveness in therapeutic processes vis-a-vis naturally derived 
products. Erythropoietin therapy with products of the present invention is also expected to be useful in the 
enhancement of oxygen carrying capacity of individuals encountering hypoxic environmental conditions 
and possibly in providing beneficial cardiovascular effects. 

15 A preferred method for administration of polypeptide products of the invention is by parenteral (e.g., 
IV, IM, SC, or IP) routes and the compositions administered would ordinarily include therapeutically 
effective amounts of product in combination with acceptable diluents, carriers and/or adjuvants. 
Preliminary pharmacokinetic studies indicate a longer half-life in vivo for monkey EPO products when 
administered IM rather than IV. Effective dosages are expected to vary substantially depending upon the 

20 condition treated but therapeutic doses are presently expected to be in the range of 0.1 (~-7U) to 100 
(-7000U) u.g/kg body weight of the active material. Standard diluents such as human serum albumin are 
contemplated for pharmaceutical compositions of the invention, as are standard carriers such as saline. 

Adjuvant materials suitable for use in compositions of the invention include compounds 
independently noted for erythropoietic stimulatory effects, such as testoste rones, progenitor cell 

25 stimulators, insulin-like growth factor, prostaglandins, serotonin, cyclic AMP, prolactin and triiodo- 
thyronine, as well as agents generally employed in treatment of aplastic anemia, such as methenolene, 
stanozolol and nandrolone [see, e.g., Resegotti, et al., Panminerva Medica, 23, 243 — 248 (1981); McGonigle, 
et al., Kidney Int., 25(21 437-^444 (1984); Pavlovic-Kantera, et al., Expt. HematoL, 8(Supp. 8), 283—291 
(1980); and Kurtz, FEBS Letters, 14a(1), 105 — 108 (1982)}. Also contemplated as adjuvants are substances 

30 reported to enhance the effects of, or synergize, erythropoietin or asialo-EPO, such as the adrenergic 
agonists, thyroid hormones, androgens and BPA [see, Dunn, "Current Concepts in Erythropoiesis", John 
Wiley and Sons (Chichester, England, 1983); Weiland, et al., Blut 44(3), 173—175 (1982); Kalmanti, Kidney 
lnt e 22, 383—391 (1982); Shahidi, New Eng. J. Med, 289, 72—80 (1973); Fisher, et al.„ Steroids, 30(6), 
833—845 (1977); Urabe, et al., J. Exp. Med., 143, 1314—1325 (1979); and Billat, et al., Expt HematoL, 10(1), 

35 133 — 140 (1982)] as well as the classes of compounds designated "hepatic erythropoietic factors" [see, 
Naughton, et al., Acta. Haemat, 69, 171—179 (1983)] and "erythrotropins" [as described by Congote, et at. 
in Abstract 364, Proceedings 7th International Congress of Endocrinology (Quebec City, Quebec, July 1 — 7, 
1984); Congote, Biochem. Biophys. Res. Comm., 115(2), 447 — 483 (1983) and Congote, Anal. Biochem., 140, 
428 — 433 (1984)] and "erythrogenins" [as described in Rothman, et al., J. Surg. Qncot., 20, 105—108 

40 (1982)]. Preliminary screenings designed to measure erythropoietic responses of ex-hypoxic polycythemic 
mice p re-treated with either 5-a-dihyd rote stoste rone or nandrolone and then given erythropoietin of the 
present invention have generated equivocal results. 

Diagnostic uses of polypeptides of the invention are similarly extensive and include use in labelled and 
unlabelted forms in a variety of immunoassay techniques including RIA's, ELISA's and the like, as well as a 

45 variety of vitro and in vivo activity assays. See, e.g., Dunn, et al., Expt HematoL, 11(7), 590—600 (1983); 
Gibson, et al., Pathology, 16, 155—156 (1984); Krystal, Expt. HematoL, 11(7), 649—660 (1983); Saito, et al., 
Jap. J. Med., 23(1), 16—21 (1984); Nathan et al., New Eng. J. Med, 308(9), 520—522 (1983); and various 
references pertaining to assays referred to therein. Polypeptides of the invention, including synthetic 
peptides comprising sequences of residues of EPO first revealed herein, also provide highly useful pure 

so materials for generating polyclonal antibodies and "banks" of monoclonal antibodies specific for differing 
continuous and discontinuous epitopes of EPO. As one example, preliminary analysis of the amino acid 
sequences of Table VI in the context of hydropathicity according to Hopp, et al., P.N.A.S. (U.S.A.), 78, pp. 
3824—3828 (1981) and of secondary structures according to Chou, et al., Ann. Rev. Biochem., 47, p. 251 
(1978) revealed that synthetic peptides duplicative of continuous sequences of residues spanning positions 

55 41 — 57 inclusive, 116—118 inclusive and 144 — 166 inclusive are likely to produce a highly antigenic 
response and generate useful monoclonal and polyclonal antibodies immunoreactive with both the 
synthetic peptide and the entire protein. Such antibodies are expected to be useful in the detection and 
affinity purification of EPO and EPO-related products. 

Illustratively, the following three synthetic peptides were prepared: 

60 

(1) hEPO 41-57, V-P-D-T-K-V-N-F-Y-A-W-K- 

R-M-E-V-G; 

(2) hEPO 116-128, K-E-A-I-S-P-P-D-A-A-S-A-A; 
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(3) hEPO 144-166, V-Y-S-N-F-L-R-G-K-L-K-L- Y- 

T-G-E-A-C-R-T-G-D-R. 

s Preliminary immunization studies employing the above-noted polypeptides have revealed a relatively 
weak positive response to hEPO 41—57, no appreciable response to hEPO 116—128, and a strong positive 
response to hEPO 144—146, as measured by capacity of rabbit serum antibodies to immunoprecipitate 125 l- 
labelled human urinary EPO isolates. Preliminary in vivo activity studies on the three peptides revealed no 
significant activity either alone or in combination. 

10 While the deduced sequences of amino acid residues of mammalian EPO provided by the illustrative 
examples essentially define the primary structural conformation of mature EPO, it will be understood that 
the specific sequence of 165 amino acid residues of monkey species EPO in Table V and the 166 residues of 
human species EPO in Table VI do not limit the scope of useful polypeptides provided by the invention. 
Comprehended by the present invention are those various naturally-occurring allelic forms of EPO which 

is past research into biologically active mammalian polypeptides such as human y interferon indicates are 
likely to exist. {Compare, e.g., the human immune interferon species reported to have an arginine residue 
at position No. 140 in EPO published application 0 077 670 and the species reported to have glutamine at 
position No. 140 in Gray, et al., Nature, 295, pp. 503—508 (1982). Both species are characterized as 
constituting "mature" human y interferon sequences.) Allelic forms of mature EPO polypeptides may vary 

20 from each other and from the sequences of Tables V and VI in terms of length of sequence and/or in terms 
of deletions, substitutions, insertions or additions of amino acids in the sequence, with consequent 
potential variations in the capacity for glycosylation. As noted previously, one putative allelic form of 
human species EPO is believed to include a methionine residue at position 126. Expectedly, naturally- 
occurring allelic forms of EPO-encoding DNA and cDNA sequences are also likely to occur which code for 

25 the above-noted types of allelic polypeptides or simply employ differing codons for designation of the 
same polypeptides as specified. 

In addition to naturally-occurring allelic forms of mature EPO, the present invention also embraces 
other "EPO products" such as polypeptide analogs of EPO and fragments of "mature" EPO. Following the 
procedures of the above-noted published application by Alton, et al. (WO/83/04053) one may readily design 

30 and manufacture genes coding for microbial expression of polypeptides having primary conformations 
which differ from that herein specified for mature EPO in terms of the identity or location of one or more 
residues (e.g., substitutions, terminal and intermediate additions and deletions). Alternately, modifications 
of cDNA and genomic EPO genes may be readily accomplished by well-known site-directed mutagenesis 
techniques and employed to generate analogs and derivatives of EPO. Such EPO products would share at 

35 least one of the biological properties of EPO but may differ in others. As examples, projected EPO products 
of the invention include those which are foreshortened by e.g., deletions [Asn 2 , des-Pro 2 through He 6 ]hEPO, 
[des-Thr 163 through Arg 166 ]hEPO and "A27— 55hEPO", the latter having the residues coded for by an entire 
exon deleted; or which are more stable to hydrolysis (and, therefore, may have more pronounced or longer 
lasting effects than naturally-occurring EPO); or which have been altered to delete one or more a potential 

40 sites for glycosylation (which may result in higher activities for yeast-produced products); or which have 
one or more cystein residues deteted or replaced by, e.g., histidine or serine residues (such as the analog 
[His 7 ]hEPO) and are potentially more easily isolated in active form from microbial systems; or which have 
one or more tyrosine residues replaced by phenylalanine (such as the analogs [Phe 15 ]hEPO, (Phe 49 ]hEPO, 
and (Phe 145 ]hEPO) and may bind more or less readily to EPO receptors on target cells. Also comprehended 

45 are polypeptide fragments duplicating only a part of the continuous amino acid sequence or secondary 
conformations within mature EPO, which fragments may possess one activity of EPO (e.g., receptor 
binding) and not others (e.g., erythropoietic activity). Especially significant in this regard are those potential 
fragments of EPO which are elucidated upon consideration of the human genomic DNA sequence of Table 
VI, i.e., "fragments" of the total continuous EPO sequence which are delineated by intron sequences and 

so which may constitute distinct "domains" of biological activity. It is noteworthy that the absence of in vivo 
activity for any one or more of the "EPO products" of the invention is not wholly preclusive of therapeutic 
utility (see, Weiland, et al., supra) or of utility in other contexts, such as in EPO assays or EPO antagonism. 
Antagonists of erythropoietin may be quite useful in treatment of polycythemias or cases of over- 
production of EPO [see, e.g., Adamson, Hosp. Practice, 78(121, 49—57 (1983), and Hellman, et al., Clin. Lab. 

5 5 Haemat, S, 335—342 (1983)]. 

According to another aspect of the present invention, the cloned DNA sequences described herein 
which encode human and monkey EPO polypeptides are conspicuously valuable for the information which 
they provide concerning the amino acid sequence of mammalian erythropoietin which has heretofore been 
unavailable despite decades of analytical processing of isolates of naturally-occurring products. The DNA 

60 sequences are also conspicuously valuable as products useful in effecting the large scale microbial 
synthesis of erythropoietin by a variety of recombinant techniques. Put another way, DNA sequences 
provided by the invention are useful in generating new and useful viral and circular plasmid DNA vectors, 
new and useful transformed and transfected microbial procaryottc and eucaryotic host cells (including 
bacterial and yeast cells and mammalian cells grown in culture), and new and useful methods for cultured 

55 growth of such microbial host cells capable of expression of EPO and EPO products. DNA sequences of the 
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invention are also conspicuously suitable materials for use as labelled probes in isolating EPO and related 
protein encoding cONA and genomic DNA sequences of mammalian species other than human and 
monkey species herein specifically illustrated. The extent to which DNA sequences of the invention will 
have use in various alternative methods of protein synthesis (e.g., in insect cells) or in genetic therapy in 

5 humans and other mammals cannot yet be calculated. DNA sequences of the invention are expected to be 
useful in developing transgenic mammalian species which may serve as eucaryotic "hosts" for production 
of erythropoietin and erythropoietin products in quantity. See, generally, Palmiter, et al.. Science, 
222(46251 809-814(1983). 

Viewed in this light, therefore, the specific disclosures of the illustrative examples are clearly not 

w intended to be limiting upon the scope of the present invention and numerous modifications and variations 
are expected to occur to those skilled in the art. As one example, while DNA sequences provided by the 
illustrative examples include cDNA and genomic DNA sequences, because this application provides amino 
acid sequence information essential to manufacture of DNA sequence, the invention also comprehends 
such manufactured DNA sequences as may be constructed based on knowledge of EPO amino acid 

75 sequences. These may code for EPO (as in Example 12) as well as for EPO fragments and EPO polypeptide 
analogs (i.e., "EPO Products") which may share one or more biological properties of naturally-occurring 
EPO but not share others (or possess others to different degrees). 

DNA sequences provided by the present invention are thus seen to comprehend all DNA sequences 
suitable for use in securing expression in a procaryotic or eucaryotic host cell of a polypeptide product 

20 having at least a part of the primary structural conformation and one or more of the biological properties of 
erythropoietin, and selected from among: (a) the DNA sequences set out in Tables V and VI; (b) DNA 
sequences which hybridize to the DNA sequences defined in (a) or fragments thereof; and (c) DNA 
sequences which, but for the degeneracy of the genetic code, would hybridize to the DNA sequences 
defined in (a) and (b). It is noteworthy in this regard, for example, that existing allelic monkey and human 

25 EPO gene sequences and other mammalian species gene sequences are expected to hybridize to the 
sequences of Tables V and VI or to fragments thereof. Further, but for the degeneracy of the genetic code, 
the SCEPO and ECEPO genes and the manufactured or mutagenized cDNA or genomic DNA sequences 
encoding various EPO fragments and analogs would also hybridize to the above-mentioned DNA 
sequences. Such hybridizations could readily be carried out under the hybridization conditions described 

30 herein with respect to the initial isolation of the monkey and human EPO-encoding DNA or more stringent 
conditions, if desired to reduce background hybridization. 

In a like manner, while the above examples illustrate the invention of microbial expression of EPO 
products in the context of mammalian cell expression of DNA inserted in a hybrid vector of bacterial 
plasmid and viral genomic origins, a wide variety of expression systems are within the contemplation of 

35 the invention. Conspicuously comprehended are expression systems involving vectors of homogeneous 
origins applied to a variety of bacterial, yeast and mammalian cells in culture as well as to expression 
systems not involving vectors (such as calcium phosphate transfection of cells). In this regard, it will be 
understood that expression of, e.g., monkey origin DNA in monkey host cells In culture and human host 
celts in culture, actually constitute instances of "exogenous" DNA expression inasmuch as the EPO DNA 

40 whose high level expression is sought would not have its origins in the genome of the host Expression 
systems of the invention further contemplate these practices resulting in cytoplasmic formation of EPO 
products and accumulation of glycosylated and non-glycosylated EPO products in host cell cytoplasm or 
membranes (e.g., accumulation in bacterial periplasmic spaces) or in culture medium supernatants as 
above illustrated, or in rather uncommon systems such as P. aeruginosa expression systems (described in 

45 Gray, et al.. Biotechnology, 2, pp. 161—165 (1984). 

Improved hybridization methodologies of the invention, while illustratively applied above to DNA/DNA 
hybridization screenings are equally applicable to RNA/RNA and RNA/DNA screening. Mixed probe 
techniques as herein illustrated generally constitute a number of improvements in hybridization processes 
allowing for more rapid and reliable polynucleotide isolations. These many individual processing 

so improvements include: improved colony transfer and maintenance procedures; use of nylon-based filters 
such as GeneScreen and GeneScreen Plus to allow reprobing with same filters and repeated use of the 
filter, application of novel protease treatments {compared, e.g., to Taub, et al. Anal. Biochem., 126, pp. 
222—230 (1982)]; use of very low individual concentrations (on the order of 0.025 picomole) of a large 
number of mixed probes (e.g., numbers in excess of 32); and, performing hybridization and post- 
55 hybridization steps under stringent temperatures closely approaching (i.e., within 4°C and preferably within 
2°C away from) the lowest calculated dissocation temperature of any of the mixed probes employed. These 
improvements combine to provide results which could not be expected to attend their use. This is amply 
illustrated by the fact that mixed probe procedures involving 4 times the number of probes ever before 
reported to have been successfully used in even cDNA screens on messenger RNA species of relatively low 
6o abundancy were successfully applied to the isolation of a unique sequence gene in a genomic library 
screening of 1,500,000 phage plaques. This feat was accomplished essentially concurrently with the 
publication of the considered opinion of Anderson, et al., supra, that mixed probe screening methods were 
. . impractical for isolation of mammalian protein genes when corresponding RNA's are unavailable. 

65- 
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Claims 

1. A DNA sequence for use in securing expression in a procaryotic or eucaryotic host cell of a poly- 
peptide product having at least part of the primary structural confirmation of that of erythropoietin to allow 
possession of the biological property of causing bone marrow cells to increase production of reticulocytes 
and red blood cells and to increase hemoglobin synthesis or iron uptake, said DNA sequence selected from 
the group consisting of: 

(a) the DNA sequences set out in Tables V and VI or their complementary strands; 

(b) DNA sequences which hybridize under stringent conditions to the protein coding regions of the 
DNA sequences defined in (a) or fragments thereof; and 

(c) DNA sequences which, but for the degeneracy of the genetic code, would hybridize to the DNA 
sequences defined in (a) and fb). 

2. A DNA sequence according to Claim 1 encoding human erythropoietin. 

3. A cDNA sequence according to Claim 1 or 2. 

4. A monkey species erythropoietin coding DNA sequence according to Claim 3. 

5. A DNA sequence according to Claim 4 and including the protein coding region set forth in Table V. 

6. A genomic DNA sequence according to Claim 1 or 2. 

7. A human species erythropoietin coding DNA sequence according to Claim 6. 

8. A DNA sequence according to Claim 7 and including the protein coding region set forth in Table VI. 

9. A DNA sequence according to Claim 1 or 2, covalently associated with a detectable label substance. 

10. A DNA sequence according to Claim 9, wherein the detectable label is a radiolabel. 

11. A single-strand DNA sequence according to Claim 9 or 10. 

12. A DNA sequence according to Claim 1, coding for [Phe 15 )hEPO, [Phe 49 ]hEP0, [Phe us ]hEP0, 
[His 7 ]nEPO, [Asn 2 des-Pro 2 through He 6 ]hEPO, [des-Thr 163 through Arg 166 ]hEPO, or (A27— 55]hEPO. 

13. A procaryotic or eucaryotic host cell transformed or transfected with a DNA sequence according to 
any one of Claims 1, 2, 3, 6, 7 and 8, in a manner allowing the host cell to express said polypeptide product. 

14. A transformed or transfected host cell according to Claim 13 which host cell is capable of 
glycosylating said polypeptide. 

15. A transformed or transfected mammalian host cell according to Claim 14. 

16. A transformed or transfected COS cell according to Claim 14. 

17. A transformed or transfected CHO cell according to Claim 14. 

18. A biologically functional circular plasmid or viral DNA vector including a DNA sequence according 
to any one of Claims 1, 2, 3, 6 r 7, 8 or 12. 

19. A procaryotic or eucaryotic host cell stably transformed or transfected with a DNA vector according 
to Claim 18. 

20. A polypeptide having part or all of the primary structural conformation of human or monkey 
erythropoietin as set forth in Table VI or Table V or any allelic variant or derivative thereof possessing the 
biological property of causing bone marrow cells to increase production of recticulocytes and red blood 
cells and to increase hemoglobin synthesis or iron uptake and characterized by being the product of 
procaryotic or eucaryotic expression of an exogenous DNA sequence. 

21 . A polypeptide according to Claim 20 characterized by being the product of eucaryotic expression of 
an exogenous DNA sequence. 

22. A glycoprotein polypeptide according to Claim 20 having an average carbohydrate composition 
which differs from that of human erythropoietin isolated from urinary sources. 

23. A polypeptide according to Claim 20, 21 or 22, wherein the exogenous DNA sequence is a cDNA 
sequence. 

24. A polypeptide according to Claim 20, 21 or 22, wherein the exogenous DNA sequence is a genomic 
DNA sequence. 

25. A polypeptide according to Claim 20, 21 or 22, wherein the exogenous DNA sequence is carried on 
an autonomously replicating circular DNA plasmid or viral vector. 

26. A polypeptide according to any one of Claims 20 to 25, further characterized by being covalently 
associated with a detectable label substance. 

27. A polypeptide according to Claim 26, wherein said detectable label is a radiolabel. 

28. A polypeptide product of the expression in a procaryotic or eucaryotic host cell of a DNA sequence 
according to any of claims 1. 2, 3, 6, 7 and 8. 

29. A process for production of a polypeptide having at least part of the primary structural 
conformation of erythropoietin to allow possession of the biological property of causing bone marrow cells 
to increase production of recticulocytes and red blood cells and to increase hemoglobin synthesis or iron 
uptake, which process is characterized by culturtng under suitable nutrient conditions a procaryotic or 
eucaryotic host cell transformed or transfected with a DNA sequence according to any of claims 1, 2, 3, 6, 7 
and 8 in a manner allowing the host cell to express said polypeptide; and optionally isolating the desired 
polypeptide product of the expression of the DNA sequence. 

30. A process according to Claim 29, characterized by culturing a host cell of any one of Claims 13 to 1 7. 

31. A process according to Claim 29 or 30 for production of a polypeptide of any one of Claims 20 to 25 
and 28. 
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32. A pharmaceutical composition comprising a polypeptide produced in accordance with the process 
of Claim 29, 30 or 31 and a pharmaceutical^ acceptable diluent, adjuvant or carrier. 

33. A pharmaceutical composition according to Claim 32 comprising a polypeptide of any one of 
Claims 20 to 25 and 28. 

34. An antibody substance characterized by immunoreactivity with erythropoietin and with a synthetic 
polypeptide having a primary structural conformation substantially duplicative or a continuous sequence 
of amino acid residues extant in erythropoietin isolated from urinary sources except for any polypeptide 
comprising a sequence of amino acid residues entirely comprehended within sequence, 

A -p-p-R-L-I-C-D-S-R-V-L-E-R-Y*L-L-E-A-K-E-A-E-N-I-T. 

35. An antibody according to Claim 34, which is a monoclonal antibody. 

36. (was 53). An antibody according to Claim 34, which is a polyclonal antibody. 

37. An antibody according to Claim 34, which is immunoreactive with erythropoietin and a synthetic 
polypeptide having the sequence selected from the sequences: 

v _p_ D _ T - K -V-N-F-Y-A-W-K-R-M-E-V-G , 
K-E-A-I-S-P-P-D-A-A-S-A-A, and 

V-y-s-N-F-L-R-G-K-L-K-L-Y-T-G-E-A-C-R-T-G-D-R- 
Patentanspruche 

1. DNA-Sequenz, die dafur verwendet wird, Expression in einer prokaryontischen oder 
eukaryontischen Wirtszelle eines Polypeptidprodukts zu erreichen, das zumindest teilweise die Primar- 
strukturkonformation von Erythropoietin aufweist, um den Besitz der biologischen Eigenschaft zu 
ermoglichen, Knochenmarkszellen zu veranlassen, die Produktion von Rektikulozyten und roten Blut- 
kdrperchen zu steigern und Hamoglobinsynthese oder Eisenaufnahme zu steigern, wobei besagte DNA- 
Sequenz aus der Gruppe ausgewahlt ist die aus: 

(a) den in den Tabellen V und VI dargelegten DNA-Sequenzen oder deren Komplementarstrange; 

(b) DNA-Sequenzen, die unter stringenten Bedingungen zu den proteincodierenden Bereichen der in 
(a) definierten DNA-Sequenzen oder Fragmenten derselben hybridisieren; und 

(c> DNA-Sequenzen, die, ohne die Entartung des genetischen Codes, zu den in (a) und (b) definierten 
DNA-Sequenzen hybridisieren wurden; besteht. 

2. DNA-Sequenz nach Anspruch 1, die Human-Erythropoietin codiert. 

3. cDNA-Sequenz nach Anspruch 1 oder 2. 

4. DNA-Sequenz nach Anspruch 3, die ein Erythropoietin einer Affenart codiert. 

5. DNA-Sequenz nach Anspruch 4, die den in Tabelle V dargestellten proteincodierenden Bereich 
einschlie&t 

6. Genomische DNA-Sequenz nach Anspruch 1 oder 2. 

7. DNA-Sequenz nach Anspruch 6, die ein Erythropoietin einer menschlichen Spezies codiert. 

8. DNA-Sequenz nach Anspruch 7, die den in Tabelle VI dargestellten proteincodierenden Bereich 
einschlie&t. 

9. DNA-Sequenz nach Anspruch 1 oder 2, die kovalent mit einer nachweisbaren Markierungssubstanz 
verbunden ist 

10. DNA-Sequenz nach Anspruch 9, dadurch gekennzeichnet, dafc die nachweisbare Markierung eine 
radioaktive Markierung ist. 

11. Einzelstrangige DNA-Sequenz nach Anspruch 9 oder 10. 

12. DNA-Sequenz nach Anspruch 1, die fur [Phe 16 ]hEPO, [Phe 49 ]hEP0, [Phe 145 ]hEPO, [His 7 ]hEPO, [Asn a 
des-Pro 2 bis lle 6 )hEPO, [des-Thr 163 bis Arg 1S6 ]hE0 oder [A27— 55]hEPO codiert 

13. Prokaryontische oder eukaryontische Wirtszelle, transformiert oder transfiziert mit einer DNA- 
Sequenz nach einem der Anspruche 1 , 2, 3, 6, 7 und 8, in einer Weise, die es der Wirtszelle erlaubt, besagtes 
Polypeptidprodukt zu exprimieren. 

14. Transformierte oder transfizierte Wirtszelle nach Anspruch 13, dadurch gekennzeichnet dafc die 
Wirtszelle in der Lage ist, besagtes Polypeptid zu glykosylieren. 

15. Transformierte oder transfizierte Saugetier-Wirtszelle nach Anspruch 14. 

16. Transformierte oder transfizierte COS-Zelle nach Anspruch 14. 

17. Transformierte oder transfizierte CHO-Zelle nach Anspruch 14. 

18. Biologisch funktioneller Zirkular-Plasmid-oder Virus-DNA-Vektor, der eine DNA-Sequenz nach 
einem der Anspruche 1, 2, 3, 6, 7, 8 oder 12 einschliefit. 

19. Prokaryontische Oder eukaryontische Wirtszelle, die mit einem DNA-Vektor nach Anspruch 18 stabil 
transformiert oder transfiziert ist 

20. Polypeptid, das einen Teil oder die Gesamtheit der PrimSrstrukturonformation von Human- oder 
Affen-Erythropoietin, wie dargestellt in Tabelle VI oder Tabelle V, oder irgendeine allelische Variante oder 
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ein Derivat desselben aufweist, das die biologische Eigenschaft besitzt, Knochenmarkszeilen zu 
veranlassen, die Produktion von Rektikulozyten und roten Blutkorperchen zu steigern und Hemoglobin- 
synthese oder Eisenaufnahme zu steigern, dadurch gekennzeichnet, dad es das Produkt prokaryontischer 
oder eukaryontischer Expression einer exogenen DNA-Sequenz ist. 
5 21. Polypeptid nach Anspruch 20, dadurch gekennzeichnet, daft es das Produkt eukaryontischer 
Expression einer exogenen DNA-Sequenz ist 

22. Glykoprotein-Polypeptid nach Anspruch 20, das eine mitt I ere Kohiehydratzusammensetzung 
aufweist die sich von derjenigen von Human-Erythropoietin unterscheidet das aus Harnquellen isoliert ist. 

23. Polypeptid nach Anspruch 20, 21 oder 22, dadurch gekennzeichnet daft die exogene DNA-Sequenz 
10 eine cDNA-Sequenz ist. 

24. Polypeptid nach Anspruch 20, 21 oder 22, dadurch gekennzeichnet daft die exogene DNA-Sequenz 
eine genomische DNA-Sequenz ist. 

25. Polypeptid nach Anspruch 20, 21 oder 22, dadurch gekennzeichnet daft die exogene DNA-Sequenz 
auf einem autonom replizierenden Zirkular-DNA-Plasmid- oder Virus-Vektor sitzt 

?5 26. Popypeptid nach einem der Anspruche 20 bis 25, weiter dadurch gekennzeichnet daft es kovalent 
mit einer nachweisbaren Markierungssubstanz verbunden ist. 

27. Polypeptid nach Anspruch 26, dadurch gekennzeichnet daft besagte nachweisbarer Markierung ein 
radioaktive Markierung ist. 

28. Polypeptidprodukt der Expression einer prokaryontischen oder eukaryontischen Wirtszelle einer 
20 DNA-Sequenz nach einem der Anspruche 1, 2, 3, 6, 7 und 8. 

29. Verfahren zur Herstellung eines Polypeptids, das zurnindest teilweise die Primarstrukturon- 
formation von Erythropoietin aufweist urn den Besitz der biologischen Eigenschaft zu ermdg lichen, 
Knochenmarkszeilen zu veranlassen, die Produktion von Rektikulozyten und roten Blutkorperchen zu 
steigern und Hamoglobinsynthese oder Eisenaufnahme zu steigern, dadurch gekennzeichnet, daft eine 

25 prokaryontische oder eukaryontische Wirtszelle, die mit einer DNA-Sequenz nach einem der Anspruche 1, 
2, 3, 6, 7 und 8 derart transformiert oder transfiziert worden ist, daft es der Wirtszelle emoglicht ist, 
besagtes Polypeptid zu exprimieren, unter geeigneten Nahrbedingungen kultiviert wird; und daft fakultativ 
das gewunschte Polypeptidprodukt der Expression der DNA-Sequenz isoliert wird. 

30. Verfahren nach Anspruch 29, dadurch gekennzeichnet, daft eine Wirtszelle nach einem der 
so Anspruche 13 bis 17 kultiviert wird. 

31. Verfahren nach Anspruch 29 oder 30 zur Produktion eines Polypeptids nach einem der Anspruche 
20 bis 25 und 28. 

32. Pharmazeutische Zusammensetzung, die ein Polypeptid, das gemaft dem Verfahren nach Anspruch 
29, 30 oder 31 produziert ist, und ein pharmazeutisch annehmbares VerdOnnungsmittel, Adjuvans oder 

35 Tragermittei umfaftt 

33. Pharmazeutische Zusammensetzung nach Anspruch 32, die ein Polypeptid nach einem der 
Anspruche 20 bis 25 und 28 umfaftt. 

34. Antikorpersubstanz, gekennzeichnet durch ImmunoreaktivitSt mit Erythropoietin und mit einem 
synthetischen Polypeptid, das eine Primarstrukturkonformation aufweist, die im wesentlichen zu einer 

40 fortlaufenden Sequenz von Amtnosaureresten duplikativ ist, die in Erythropoietin, das aus Harnquellen 
isoliert ist, vorhanden ist, mit Ausnahme von jedem Polypeptid, das eine Sequenz von Aminosaureresten 
umfaftt die vollstandig in folgender Sequenz enthalten ist 

A-P-P-R-L-I-C-D-S-R-V-L-E-R-Y-L-L-E-A-K-E-A-E-N-I-T, 

45 

35. Antikorper nach Anspruch 34, dadurch gekennzeichnet daft er ein monoklonaler Antikorper ist 

36. Antikorper nach Anspruch 34, dadurch gekennzeichnet daft er ein polyklonaler antikorper ist 

37. Antikorper nach Anspruch 34, der mit Erythropoietin und einem synthetischen Polypeptid 
immunoreaktiv ist das die Sequenz aufweist, die aus den folgenden Sequenzen ausgewahlt ist: 

so 

V-P-D-T-K-V-N-F-Y-A-W-K-R-M-E-V-G, 
K-E-A-I-S-P-P-D-A-A-S-A-A, und 

V-Y-S-N-F-L-R-G-K-L-K-L-y-T-G-E-A-C-R-T-G-D-R* 

DO 

Revendicatlons 

1. Sequence d'ADN utilisable pour garantir I'expression, dans une cellule note procaryote ou 
so eucaryote, d'un polypeptide ayant au moins une partie de la conformation structurelle principale de celle 
de l'6rythropoi6tine pour permettre de posseder la propriete biologique que des cellules de ta moelle 
osseuse provoquent I'augmentation de la production de reticulocytes et de globules rouges et 
I'augmentation de la synthese de I'hemoglobine et du captage du fer, ladite sequence d'ADN etant choisie 
dans le groupe constitue de: 
65 (a) les sequences d'ADN indiquees dans les tableaux V et VI ou leurs brins complementaires. 
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(b) des sequences d'ADN qui s'hybrident dans des conditions strictes dans les regions de codage de 
proteines des sequences d'ADN definies dans <a) ou des fragments de celles-ci, et 

(c) des sequences d'ADN qui, n'etait ia degenerescence du code genetique, s'hybrideraient en les 
sequences d'ADN definies dans (a) et (b). 

2. Sequence d'ADN selon la revendication 1 codant I'erythropoietine humaine. 

3. Sequence d'ADNc selon la revendication 1 ou 2. 

4. Sequence d'ADN de codage de I'erythropoietine de singe selon la revendication 3. 

5. Sequence d'ADN selon la revendication 4 et incluant la region de codage de proteines indiquee dans 
le tableau V. 

6. Sequence d'ADN genomique selon la revendication 1 ou 2. 

7. Sequence d'ADN codant I'erythropoietine humaine selon la revendication 6. 

8. Sequence d'ADN selon la revendication 7 et incluant la region de codage de proteines indiquee dans 
le tableau VI. 

9. Sequence d'ADN selon la revendication 1 ou 2, associee, de fagon covalente, a une substance de 
marquage detectable. 

10. Sequence d'ADN selon la revendication 9, dans laquelle le traceur detectable est un radiotracer. 

11. Sequence d'ADN a brin unique selon la revendication 9 ou 10. 

12. Sequence d'ADN selon la revendication 1, codant pour [Phe 15 )hEPO, [Phe 49 ]hEPO, [Phe 14S lhEP0, 
[His 7 ]nEP0, [Asn 2 des-Pro 2 par lle 6 ]hEPO, |des-Thr 163 par Arg 166 ]hEPO, ou [A27— 55]hEPO. 

13. Cellule note procaryote ou eucaryote transformed, ou soumise a une transfection, par une 
sequence d'ADN selon Tune quelconque des revendications 1, 2, 3, 6, 7 et 8, d'une fagon permettant a la 
cellule note d'exprimer ledit polypeptide. 

14. Cellule note transforrnee, ou soumise a une transfection, selon la revendication 13, laquelle cellule 
note est capable d'effectuer la glycosylation dudit polypeptide. 

15. Cellule note de mammifere transforrnee, ou soumise a une transfection, selon la revendication 14. 

16. Cellule COS transforrnee, ou soumise a une transfection, selon la revendication 14. 

17. Cellule CHO transforrnee, ou soumise a une transfection, selon la revendication 14. 

18. Vecteur d'ADN viral ou plasmide circulate biologiquement fonctionnel incluant une sequence 
d'ADN selon Tuner quelconque des revendications 1, 2, 3, 6, 7, 8 ou 12. 

19. Cellule hdte procaryote ou eucaryote transformed de facon stable, ou soumise a une transfection, 
par un vecteur d'ADN selon la revendication 18. 

20. Polypeptide ayant tout ou partie de la conformation structurelle principale de I'erythropoietine 
humaine ou de singe, comme indiquee dans le tableau VI ou le tableau V, ou un variant allelomorphe 
quelconque ou un de>iv6 de celui-ci possedant la propri6t6 biologique que des cellules de la moelle 
osseuse provoquent I'augmentation de la production de reticulocytes et de globules rouges et 
I'augmentation de la synthese de I'hemoglobine et du captage du far, et caracterise comme etant le produit 
de I'expression procaryote ou eucaryote d'une sequence d'ADN exogene. 

21. Polypeptide selon la revendication 20, caracterise comme etant le produit de I'expression eucaryote 
d'une sequence d'ADN exogene. 

22. Polypeptide de glycoproteine selon la revendication 20, ayant une composition moyenne de carbo- 
hydrates qui differe de celle de I'erythropoietine humaine iso!6e a partir de sources urinaires. 

23. Polypeptide selon la revendication 20, 21 ou 22, dans lequel la sequence d'ADN exogene est une 
sequence d'ADNc. 

24. Polypeptide selon la revendication 20, 21 ou 22, dans lequel la sequence d'ADN exogene est une 
sequence d'ADN genomique. 

25. Polypeptide selon la revendication 20, 21 ou 22, dans lequel la sequence d'ADN exogene est ported 
sur un vecteur viral ou un plasmide d'ADN circulaire se repliquant de facon autonome. 

26. Polypeptide selon I'une quelconque des revendications 20 a 25, caracterise de plus comme etant 
associe, de fagon covalente, a une substance de marquage detectable. 

27. Polypeptide selon la revendication 26, dans lequel ledit traceur detectable est un radiotraceur. 

28. Polypeptide produit par I'expression, dans une cellule note procaryote ou eucaryote, d'une 
sequence d'ADN selon I'une quelconque des revendications 1, 2, 3, 6, 7 et 8. 

29. Proc&te pour produire un polypeptide ayant au moins une partie de la conformation structurelle 
principale de I'erythropoietine pour permettre de posseder la propriete biologique que les cellules de la 
moelle osseuse provoquent I'augmentation de la production de reticulocytes et de globules rouges et 
I'augmentation de la synthese de I'hemoglobine et du captage du fer, lequel procede est caracterise par la 
mise en culture, dans des conditions nutritives appropriees, d'une cellule note procaryote ou eucaryote 
transforrnee, ou soumise a une transfection, par une sequence d'ADN selon I'une quelconque des 
revendications 1, 2, 3, 6, 7 et 8, d'une fagon permettant a la cellule note d'exprimer ledit polypeptide; et 
eventuellement, par I'isolement du polypeptide souhaite produit par I'expression de la sequence d'ADN. 

30. Procede selon la revendication 29, caracterise par ia mise en culture d'une cellule note selon I'une 
quelconque des revendications 13 a 17. 

31. Procede selon la revendication 29 ou 30 pour la production d'un polypeptide selon rune 
quelconque des revendications 20 a 25 et 28. 

32. Composition pharmaceutique comprenant un polypeptide produit conformement au procede de la 
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revendication 29, 30 ou 31 et un support, adjuvant ou diluant pharmaceutiquement acceptable. 

33. Composition p harm ace utique selon la revendication 32, comprenant un polypeptide selon I'une 
quelconque des revendications 20 a 25 et 28. 

34, Substance anticorps, caracterisee par I'immunoreactivite avec I'erythropoietine et avec un poly- 
peptide synthetique ayant une conformation structured principale sensiblement duplicative d'une 
sequence continue de residus d'acides amines existant dans de I'erythropoietine isolee de sources 
unnaires, a I'exception d'un polypeptide quelconque comprenant une sequence de residus d'acides 
amines entierement comprise dans la sequence: 



A-P-P-R-L-I-C-D-S-R-V-L-E-R-Y-L-L-E-A-K-E-A-E-N-I-T. 

35. Anticorps selon la revendication 34, qui est un anticorps monoclonal. 

36. Anticorps selon la revendication 34, qui est un anticorps polyclonal, 

37. Anticorps selon la revendication 34, qui est immunoreactif avec I'erythropoietine et un polypeptide 
synth6tique ayant la sequence choisie a partir des sequences: 

V-P-D-T-K-V-N-F-Y-A-W-K-R-M-E-V-G, 
K-E-A-I-S-P-P-D-A-A-S-A-A, et 

V-Y-S-N-F-L-R-G-K-L-K-L-Y-T-G-E-A-C-R-T-G-D-R, 
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